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ABSTRACT 

We develop a general framework for quantifying the many different contributions to the noise budget of an image made with an 
array of dishes or aperture array stations. Each noise contribution is associated with a relevant correlation timescale and frequency 
bandwidth so that the net impact in a complete observation can be assessed. All quantities are parameterised as function of observing 
frequency and the visibility baseline length. We apply the resulting noise budget analysis to a wide range of existing and planned 
telescope systems that will operate between about 100 MHz and 5 GHz to ascertain their imaging performance and limitations. We 
conclude that imaging performance is adversely impacted in several respects by small dimensions of the dishes or aperture array 
stations. It will be more challenging to achieve thermal noise limited performance using 15 m class dishes rather than the 25 m dishes 
of current arrays. Some of the performance risks are mitigated by the deployment of phased array feeds and more with the choice of 
an (alt, az, pol) mount, although a larger dish diameter offers the best prospects for risk mitigation. Many improvements to imaging 
performance can be anticipated at the expense of greater complexity in calibration algorithms. However, a fundamental limitation 
is ultimately imposed by an insufficient number of data constraints relative to calibration variables. The upcoming aperture array 
systems will be operating in a regime that has never previously been addressed, where a wide range of effects are expected to exceed 
the thermal noise by two to three orders of magnitude. Achieving routine thermal noise limited imaging performance with these 
systems presents an extreme challenge. The magnitude of that challenge is inversely related to the aperture array station diameter. 

Key words. Instrumentation: interferometers, Methods: observational, Techniques: interferometric, Telescopes 



1. Introduction 

Dynamic range limitations in synthesis imaging arise from three 
distinct categories of circumstance: 1. Instrumental artefacts. 2. 
Imaging artefacts. 3. Incomplete calibration of the instrumental 
response. The first category can be addressed by insuring a linear 
system response to signal levels together with other design mea- 
sures within the receiver and correlator systems that minimise 
spurious responses. While challenging to achieve, the engineer- 
ing requirements in this realm are moderately well defined and 
this class of circumstance will not be considered further in the 
current discussion. Some of the specific relevant effects, such 
as "closu r e erro rs" and quantisation corrections are discussed 
in Perlev (1999) The second category includes a range of ef- 
fects, from the well understood smearing effects that result from 
a finite time and frequency sampling to the imaging challenges 
associated with noncoplanar baselines as discussed in Cornwell 
et al. (20081) The third category is one that is less well under- 
stood and docume nted. Sever al important effects in this area are 
also discussed by iPerlevI (19991), including imperfect polarisa- 
tion calibration, inadequate (u,v) coverage, and numerical mod- 
eling errors. In this note we will attempt to identify and quantify 
the phenomena that influence synthesis image dynamic range 
more generally. 

2. Calibration Errors 

A key factor that determines image dynamic range is calibra- 
tion of the instrumental gain. Instrumental gain, in this con- 
text, should be considered to be the (spatial, polarisation, fre- 



quency) response in (amplitude, phase) as function of time of 
each of the antennas (be they aperture array stations, or the 
phased-array fed- or single pixel fed- dish beams) of a syn- 
thesis array. Traditional array calibration methods, employing 
semi-continuous (minute scale) noise source injection and occa- 
sional (hour scale) observations of well-characterised calibration 
sources routinely achieve a calibration precision at GHz frequen- 
cies of a few percent in amplitude and about 10 degrees of phase. 
This gain calibration precision, 4>c - 0.2, is only sufficient to 
achieve an image dynamic range of less than about 1000:1 in 
full track observations with existing arrays. A ro ugh estimate o f 
the relation between these quantities is derived bv lPerlevh 1999) 
The dynamic range, D, is about, 



D « M° T 5 M° F 5 N/4>, 



(1) 



for Mj independent time and Mf frequency intervals and N 
antennas that each have independent gain errors of magnitude 
(f>. What constitutes an independent time sample depends criti- 
cally on the nature of the gain error together with both baseline 
length and field of view; varying from hours to fractions of a 
second. Similar considerations also apply to what constitutes an 
independent frequency interval. Improvements of the image dy- 
namic range fro m this base level rely on self-calibration. Self- 
calibration (e.g. lCornwell & Fomalontl 1999) consists of the it- 
erative modeling of the sky brightness distribution together with 
the instrumental calibration. Self-calibration only works when 
two conditions are met: 
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1 . There are a sufficiently small number of degrees of freedom 
in this model relative to the number of independent data con- 
straints. 

e-mail: 2. The signal-to-noise ratio in the data for each antenna is suf- 
ficiently high within each solution interval. 
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The number of data constraints (for a given frequency 
channel and polarisation state) is No ~ N 2 /2 over the solu- 
tion timescale over which all model parameters are constant. 
Significant instrumental/ atmospheric/ ionospheric gain fluctu- 
ations typically occur on (sub-) minute timescales. If the sky 
brightness is dominated by a single compact source, then the 
number of degrees of freedom, Nf, is Nf ~ 1 + N, and its 
clear that the system is highly over-determined (No » Nf) 
for the solution of the N antenna gains in a single source di- 
rection within each solution interval. If, on the other hand, the 
sky brightness is characterised by a large number of compo- 
nents, Nc, each of which is represented by multiple variables 
(eg. a peak brightness, two positions, two sizes and an orienta- 
tion) that approaches A^ 2 in number, then its clear that difficulties 
can arise. In that case, it has become customary to assume that 
the spatial instrumental response to the sky brightness remains 
time invariant, so that the total number of degrees of freedom, 
Nf ~ 6Nc + MjN still remains small relative to the complete 
observation that comprises Nj = MjN 1 /2 total independent 
data constraints. We will explore below the circumstances under 
which this assumption breaks down. The requirement of suffi- 
cient signal-to-noise in the data from each antenna has a direct 
bearing on t he precision of the self- calibration solution that can 
be achieved. ICornwell & Fomalontl ( 19991) give an estimate for 
the residual calibration error for a given solution timescale, tj , 
as (p(rs) = <t(ts)/[(N - 3) 05 S] in terms of the fluctuation level 
of a single visibility <x(t.s) and a single dominant, unresolved 
source of flux density, S . In the case of a more complicated sky 
brightness distribution composed of Nc discrete components of 
total flux density STot, this becomes: 

<P(T S ) = cr(T S )N° C 5 /[(N- 3f 5 S rot] , (2) 

since the Nc components will not contribute coherently to the 
net visibility vector. A practical solution timescale in the sin- 
gle, on-axis source case would be tied to the intrinsic fluctuation 
timescale of about a minute, while for the multiple component 
case, it will be determined by the off-axis distance on the sky of 
these components. As we will see below, this will be compara- 
ble to a single integration time; typically one to ten seconds, for 
sources within the main beam of the antenna. When both these 
conditions are satisfied to a sufficient degree, it has proven pos- 
sible to achieve image dynamic range, D = S Max I o~ Map, defined 
to be ratio of peak brightness, S Max, to image RMS, a- M a P , of 
more than one million to one with current arrays after consider- 
able effort by "black belt" level interferometrists. Its vital to note 
that the hig hest dynamic ra nge imaging that has been achieved 
to date (eg. lSmirnovll201 la) is associated with fields dominated 
by particularly bright, yet simple sources. Fields that are domi- 
nated by more typical complex sources have not achieved any- 
thing comparable. 

Rather than specifying some arbitrarily high numerical value 
of the dynamic range as a goal for a particular instrument or 
design, a more useful requirement might be that the instrument 
routinely achieves thermal noise limited performance even after 
long integrations. Since the noise in a naturally weighted synthe- 
sis image is simply related to the visibility noise as, 

crMa P = airs )/[M T M F N(N - l)/2] a5 , (3) 

we can assess individual contributions, denoted by subscript i, to 
the final image noise by considering their magnitude on a self- 
cal solution timescale, o"/(ts), together with the values of Mj-, 
and Mpi which apply to each particular contribution. It is im- 
portant to stress that the equivalent image noise calculated in 



eqn.[3]will only represent an actual image noise when it relates 
to objects within the image field of view. For effects that per- 
tain to sources beyond the field of view the influence is more 
indirect as will be discussed in detail below. However, when 
self-calibration is employed, even the out-of-field error contri- 
butions will result in this magnitude of image fluctuation due to 
their adverse impact on the self-calibration error. Equation[3]will 
be used to track the magnitude of the different contributions to 
self-calibration error while taking into account the very different 
correlation timescales and bandwidths that apply to each. 

As noted above, a major simplifying assumption that is of- 
ten implicitly, if not explicitly, invoked is that of a time-invariant 
observed "sky" during the period of data acquisition. The under- 
lying assumption is that neither the radio sky nor the (spatial, po- 
larisation, frequency) components of the instrumental response 
vary with time and that only a single (amplitude, phase) term is 
sufficient to capture the time variable nature of the instrumental 
response. What if this is not the case? 

2.1. FarSidelobes 

The assumption of a stationary sky brightness will break down 
in practise for several reasons. Firstly, there is the residual all- 
sky response due to the far sidelobe pattern of the antenna. The 
far sidelobe attenuation of an antenna relative to the on-axis re- 
sponse, £f, is given approximately by the ratio of the effective 
area of a single dipole to that of the entire antenna, 

e F = rf F (A/d) 2 , (4) 

with a proprotionality constant tjf that is tied to the quality of 
aperture illumination and varies between about 0.1 < 7]f < 1 
in practise. The (A/d) 2 dependence is a manifestation of physi- 
cal optics and applies to both blocked and unblocked apertures. 
Far sidelobe levels have been accurately deter mined fo r the 2 5 
m diameter VLBA antennas at 1717 MHz bv iDhawanl d2002h . 
Peak sidelobe levels of about -53dB were measured at 30-110 
degrees off-axis, yielding rjp ~ 0.1. Dhawan (2002) note that 
while these VLBA levels are in good agreement with theoreti- 
cal expectations, the values for VLA antennas were about 6 dB 
higher at that time (tjf ~ 0.4), likely due to nonoptimum illumi- 
nation by the 20cm feed lens system that has since been replaced 
in the JVLA upgrade. The prime focus fed 25 m diameter dishes 
of the WSRT have measured far sidelobe levels at A =49 and 
92 cm that are consistent with t\f ~ 0.1 (lBraunlll993h . All ra- 
dio sources above the horizon are attenuated by only this factor 
when they appear in a far sidelobe peak of the antenna. As the 
mainbeam of the antenna is used to track a source on the sky, 
the apparent brightness of the sky will vary due to the rising and 
setting of specific sources relative to the antenna horizon, but 
also due to changes in the far sidelobe response including, but 
not limited to, rotation of that pattern relative to the sky for an 
antenna mount that does not compensate for parallactic angle. 
The far sidelobe voltage response pattern of an antenna consists 
of local maxima and minima that are comparable to the main 
beam in size and separated by nulls. The basic pattern will have 
a radial frequency scaling, but may also contain additional fre- 
quency dependence, for example when multi-path propagation 
conditions are present. 

The properties and density of radio sources are summarised 
in Table [D Values a re based on the NRAO VLA Sky Survey 
dCondon et al.l 19981) for source densities and median flux den- 
sities in each bin at GHz frequencies, and Windhorst (20031) for 
the median angular size and spectral index. Some care needs to 
be exercised with the interpretation of the angular size, since in 
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Table 1. Statistical source properties of the extragalactic sky at 
1.4 GHz in bins of 0.5 dex. 



Si AGHz 


Ml/2 


N(>S) 


FWHM 


Spectral Index 


(Jy) 


(Jy) 


(sr 1 ) 


(arcsec) 




30 


272 


2.4 


160 


-0.80 


10 


14 


5.7 


100 


-0.85 


3 


4.2 


33 


25 


-0.87 


1 


1.4 


203 


15 


-0.90 


0.3 


0.42 


1360 


10 


-0.87 


0.1 


0.14 


5920 


8 


-0.83 


0.03 


0.042 


21700 


5 


-0.80 


0.01 


0.014 


56100 


4 


-0.75 


0.003 


0.0042 


136000 


3 


-0.70 



many cases this refers to the separation of more compact com- 
ponents, rather than to the effective diameter of a diffuse source. 
This is particularly true for the brightest GHz source counts that 
are dominated by luminous classical double sources down to 
about 30 mJy. Below about 30 mJy, the source counts begin to 
be dominated by lower luminosity, edge-darkened radio galaxies 
and below about 0.1 mJy by star forming galaxies (e.g. Wilman 
et al. 2008) 

A good indication for how the flux density of high luminos- 
ity radio galaxies varies with scale is given in Figure Q] showing 
the power spectrum of Cygn us A based on a VL A A+B+C+D 
configuration 6 cm image of Perl ev etaD ( 1984) The solid line 
is the average normalised power as function of inverse scale, 
deconvolved for PSF tapering in the observation based on the 
short-dashed curve. The long-dashed curve is a power law of 
slope -0.75 which provides a reasonable representation of the 
trend for scales up to 30 times smaller than the source size. The 
Cygnus A power spectrum can be well modeled by the sum of 
four Gaussian components as shown by the dotted curve that 
has relative angular scales of (1: 0.29: 0.039: 0.0067) and am- 
plitudes of (0.648: 0.237: 0.107: 0.008). From the combination 
of number densities and source sizes in Table 1 together with 
the power spectrum in Figure [1] one can construct a compos- 
ite fluctuation spectrum due to the high luminosity extragalac- 
tic background sources within 2n steradians. This is shown in 
Figure [2] as the solid line, while the short-dashed curves are the 
contributions from the individual flux bins down to 30 mJy from 
Table 1 . Within each flux bin, we form the incoherent sum of the 
relevant sources by taking the square root of the source number 
multiplied by the median flux of that bin. Similarly, the compos- 
ite fluctuation level is taken to be the square root of the sum of 
the squares of the bin contributions. This is intended to describe 
the net composite fluctuation level in an interferometric visibility 
for a given baseline length. The net composite fluctuation level 
is well fit by a power law of slope -0.85 in baseline length from a 
base level of 450 Jy at 1 km as shown by the long-dashed curve. 

The other important parameter to quantify is the effective 
number of model components that would be needed to represent 
the sky brightness distribution. Within each flux bin of Table Q] 
we have taken the number of sources within 2n steradians and 
scale this with the inverse of the normalised power spectrum 
shown in Figure Q] In this way, as each source becomes re- 
solved it is distributed into a proportionally larger number of 
fainter components. We then form the weighted sum of compo- 
nents over flux bins using a weighting factor equal to the median 
flux of that bin. The total effective component number is shown 
as the solid line in Figure [2] while the individual bin contribu- 
tions are plotted as the short-dashed lines. As expected, only a 
few hundred components dominate the sky on all baselines be- 



tween about 0.5 and 50 km. On longer baselines the total number 
increases approximately as a power law of baseline with slope 
~0.4 from a base level of 100 on 1 km baselines as shown with 
the long-dashed curve. 

With the -50dB peak far sidelobe level of a 25 m dish ob- 
serving at 25 cm, one hemisphere of the extragalactic radio sky 
would contribute some 5 mJy of noise-like fluctuations to visi- 
bility measurements on 1 km baselines. Adequate modeling of 
these sources would require about 100 components on this same 
scale. On 10 km scales such fluctuations will have declined to 
the 0.7 mJy level and would require some 100's of components 
to model. 

The Sun is the other major contributor to residual visibil- 
ity fluctuations when not adequately modeled. We plot the rel- 
ative visibility amplitude of the Sun in Figure [1] based on 6cm 
VLA mosaic images kindly provided by White (2002) and 2012, 
private communication). The two observing dates correspond 
to relatively active and quiet portions of the Solar cycle. The 
Solar power spectrum can be well modeled by the sum of three 
Gaussian components, as shown by the dotted curve, that have 
relative angular scales of (1: 0.0625: 0.0156) and amplitudes of 
(0.9645: 0.0302: 0.0053). The corresponding visibility "noise" 
and component number are plotted in Figure [3] as function of 
baseline length. The visibility modulation level was taken to be 
the power spectrum of Figure Q] divided by the square root of 
component number to account for the fact that the individual 
solar flux components will not sum coherently. The 1 .4 GHz in- 
tegral flux on these dates has been scaled to an assumed total 
of 100 sfu on 99 APR 11 and 50 sfu on 93NOV06 in line with 
the daily solar monitoring data from those epochs. With 50dB of 
attenuation, there is still a residual fluctuation level of about 10 
mJy on 1 km scales that would require about 100 components 
for adequate modeling. 

2.1.1. Radio Frequency Interference 

Another contribution to the far sidelobe response from the tele- 
scope environment is that of radio frequency interference (RFI) 
from ground-based, aerial and space-based sources. Clearly, the 
greater the degree of attenuation outside of the main beam, the 
more straightforward it becomes to deal with the power levels 
of non-astronomical transmissions. In view of the great diver- 
sity of RFI attributes, we will not attempt to characterise them 
here. We simply note that adequate RFI recognition, excision or 
mitigation will be essential to achieving thermal noise limited 
performance. There are many potential telescope locations and 
frequency bands in which RFI will preclude high performance 
imaging. 

2.1.2. The Ionosphere and Atmosphere 

A significant complicating factor in wide-field imaging is the 
direction dependent nature of propagation effects relative to a 
ground-based telescope array. The diffracti ve and refractive ef - 
fects of t he ionosphere and troposphere (Inte ma""et alJ (2009) 
ISmirn ov (2011a)) make it necessary to determine distinct cali- 
bration solutions for sources separated by more than about 1 de- 
gree and timescales of about 1 minute. The situation is further 
complicated for an array that is spatially extended relative to the 
physical size of density fluctuations, since different telescopes 
then have distinct propagation paths through the medium and a 
distant thin screen approximation is no longer valid. The rele- 
vant size scale for traveling ionospheric disturbances (TIDs) is 
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Fig. 1. The relative visibility amplitude of Cygnus A (left) and the Sun (right) are shown as the solid curves. The short-dashed curve 
shows the point spread function of the observations, while the long-dashed curve a power-law approximation. The dotted curve is 
a sum of four (left) and three (right) Gaussians that approximates the measured spectra. Both the active (1999 APR! 1) and quiet 
(1993NOV06) Sun are shown. 




Fig. 2. The incoherent visibility "noise" due to 2n steradians of the extragalactic sky (left) and the corresponding number of flux- 
weighted components (right) as function of baseline length (angular scale). Visibility "noise" is completely dominated by the 
brightest source bin out to baselines of about 30 km that would require several hundred components to adequately model. 



a few km. While we will not attempt to quantify the impact of 
residual solution errors, we note the necessity for distinct, time 
and position dependent calibration solutions due to these effects. 



2.1 .3. The Role of a Global Sky Model 

One might consider that a major aid to the self-calibration prob- 
lem would be the provision of a high quality model of the entire 
radio sky at each frequency of an observation. To the extent that 
intrinsic source variability or propagation effects (in the inter- 
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Fig. 3. The visibility "noise" due to the active and quiet Sun (left) and the corresponding number of components (right) as function 
of baseline length (angular scale). 



stellar, interplanetary, ionospheric or tropospheric media) were 
negligible this might eliminate many degrees of freedom in the 
self-calibration solution. While undoubtedly of great benefit, a 
global sky model will only partially reduce the number of de- 
grees of freedom in the solution (source positions), since a re- 
maining unknown is generally the instrumental response in the 
direction of each source at each solution time. The accuracy with 
which far sidelobe responses of an antenna system can be pre- 
dicted at any specific location is typically no better than a fac- 
tor of two, implying that this prediction has little value in con- 
straining a solution for the apparent brightness of each source 
within an observation. Although very significant effort is being 
expended to improve the pr ediction of far sidelob e responses in 
the aperture array domain (Wiin holds et al.l 20101) it is not yet 
clear how effective these will be in practise. 

In the event that the global sky model also incorporates an 
extremely high fidelity, high resolution representation of every 
relatively bright source in the sky, it would provide an invaluable 
resource for accurate self-calibration. As will become apparent 
subsequently, the accurate modeling of the random sources that 
happen to occur within the field of view is often the primary lim- 
itation to the dynamic range that can be achieved. While this has 
been put forward as the preferred calibratio n method for surv ey 
projects such as planned with LOFAR (eg. lHeald et al.l 2010l) in 
practise it is likely to be the ultimate outcome of a multi-year 
data acquisition and source modeling effort, rather than a conve- 
nient starting point for a random field of interest. 

2.1.4. Time and Bandwidth Smearing/Correlation 

The visibility response to distant off-axis sources will be mod- 
ified by several effects, most notably time- and bandwidth- 
smearing for the finite integration times and frequency resolu- 
tion used in an observation. This can be viewed both positively 
and negatively. The peak visibility response will be diminished 
by these smearing effects and this will be beneficial in reduc- 



ing noise-like modulations. However, if the source is still bright 
enough that it must be modeled, then the smearing will adversely 
impact the quality of that modeling and contribute to residual 
gain errors in the calibration. The angular extent of time smear- 
ing of the response due to earth rotation is given approximately 
by, 

6 T = arr s 0, (5) 

for an angular rotation rate w, (15 deg/hr) an integration time, ts , 
and an offset angle from the phase centre, 8. For a source on the 
celestial pole viewed by an east-west baseline, this smearing is 
along a circular track, while for other circumstances the smear- 
ing direction is more complex. The bandwidth smearing has an 
extent, 

5 V = 8Av s fv, (6) 

for a fractional bandwidth Av,s /v and is in a radial direction. For 
example, an unresolved source that is offset from the pointing 
direction by 1 radian that is sampled with an integration time of 
1 second with a fractional bandwidth of 10~ 4 will suffer about 15 
arcsec of both time smearing and bandwidth smearing. The spe- 
cific choice of integration times and averaging bandwidths is de- 
termined by the need to keep these effects at the level of a small 
fraction, rjs , of the synthesized beamwidth within the antenna 
main beam. In terms of a dish/station diameter, d, and array of 
baseline, Bmox, this implies an integration time no longer than, 

t* < nsdlicoBuax) = 1-38 x lQ%dlB Max (sec), (7) 

and a fractional bandwidth no larger than, 

Av*/v < J] S d/B Max . (8) 

For example, for rjs — 0.1, d — 25 m and Bm c ,x = 25 km, the re- 
quirements are Av*/v < 10~ 4 and t* < 1 sec. Exactly the same 
considerations can be used to evaluate the likely time and fre- 
quency intervals over which particular types of calibration errors 
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might be correlated. If the calibration error relates to a source 
that is near the half power point of the main beam, this corre- 
sponds to 6 T = 6 V = A/B and 9 = A/ (2d) in eqns.[5]and [6j which 
will be correlated over time intervals of, 



T*2dl(u>B)= 1.38 x 10 4 2d/fi (sec), 
and bandwidths, 
Av/v * 2d/B. 



(9) 



(10) 



For example, for d — 25 m and Bmux - 25 km, the calibration 
errors associated with sources near the half power point of the 
main will be correlated over Av/v ~ 2 x 1CT 3 and r « 28 sec. In 
the far sidelobe regime, with 9-1 radian, the correlation time 
reduces to, 



r * A/(coB) = 1 .38 x 10 4 A/B (sec), 
and bandwidth to, 
Av/v * A/B. 

2.1.5. Visibility Consequence 



(11) 



(12) 



The consequence of unmodeled off-axis sources in the visibility 
data is a modulation of the visibilities with a magnitude equal 
to that of the unmodeled source. Unresolved and unsmeared 
sources will have the same modulation amplitude on all observed 
baselines, while resolved and/or smeared sources will have a 
modulation amplitude that decreases with increasing baseline 
length. Since these are periodic modulations, with a modulation 
wavelength, m = A/ 8, that is very short for large pointing off- 
sets, they will partially cancel when averaged over finite time 
intervals or within gridded visibility cells. However, the pres- 
ence of such residual modulations will negatively influence the 
quality of the self-calibration that can be achieved, particularly 
on relatively short timescales over which the cancellation will be 
increasingly imperfect. In the case where successive integration 
times sample visibilities that are offset from one another by a 
significant fraction of A/ 6, i.e. baselines of length greater than, 
B > ABMax/(i]s6d), or about a km for GHz observations with 9 = 
1 radian, such unmodeled sources will add a noise-like contribu- 
tion in quadrature to the visibility noise ct(t) and directly impact 
the calibration precision </>, and ultimately the final image noise 
level. 

We have attempted to verify the analytic estimates of far 
sidelobe effects by simulating model visibilities as observed by 
the VLA C configuration at 1.4 GHz. All radio sources within 
the NVSS database brighter than 10 Jy and above the local VLA 
horizon at LST 12:00:00 were added to an othe rwise empty visi- 
bility file of 12 hour duration using the miriad (ISault e t al. 1995) 
task uvgen. Each catalogued source was assumed to be com- 
posed of the sum of four Gaussian contributions with angular 
sizes of (1: 0.29: 0.039: 0.0067) relative to the tabulated FWHM 
and amplitudes of (0.648: 0.237: 0.107: 0.008) relative to the in- 
tegrated flux density. The response was calculated relative to a 
pointing centre at (RA, Dec)=(12 /! , 34deg) both with and with- 
out a three Gaussian component model of the Sun (assumed to 
be at (RA, Dec) = (9 h , -11 deg) when present) as illustrated in 
Figure [3] The intrinsic fluctuation levels are indicated by the 
solid curves in Figure |4] The residual fluctuations including the 
smearing effects of a 10 sec integration time and Av/v = 10~ 3 
frequency sampling are indicated by the dashed curves. (We note 
that since the uvgen task does not provide explicit support for 
time and bandwidth smearing, these effects were introduced by 



generating a highly oversampled simulation that was vector av- 
eraged appropriately after calculation.) The intrinsic fluctation 
spectra are flatter by unit slope compared to those which include 
both smearing effects, as expected. No primary beam attenua- 
tion has been applied. Representative power law approximations 
of the post-smearing fluctuation levels are, 
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(14) 



with the Sun present. The ep term represents the far side-lobe 
response from equation [4] The extragalactic contribution scales 
roughly as v~°- 8 , given the steep spectra of the brightest sources, 
while the Solar spectrum at GHz frequencies is relatively flat. 
The baseline length is normalised with B^ = (v/1.4 GHz) 1 km. 
These measured fluctuation spectra are quite similar to those ob- 
tained by the analytic estimate shown previously in Figure [2] 
and [3] although with a slightly flatter slope when the individ- 
ual source attributes and positions are employed rather than the 
tabulated binned source properties. Clearly, the daytime power- 
law approximation is only valid for baselines less than about 
B/Bk = 3, where the Solar contribution dominates over that of 
the extragalactic sky. 

An important aspect of the far sidelobe contribution to visi- 
bility "noise" is that it is essentially uncorrelated between adja- 
cent frequency channels and time stamps. Eqns.[T2]and[TT]yield 
Av/v ~ 10~ 4 and t m 1.5 sec at a typical off-axis distance of 
1 radian. While this noise contribution may adversely influence 
individual solutions, at least it will average down substantially 
during long tracks and over significant bandwidths. The relevant 
values of Mj t and Mr ,• are given simply by the total number of 
solution intervals in time and frequency, as, 



M TF = t t /t s 
and 



(15) 



(16) 



M FF = Av T /Av s 
for a total observing time tj and bandwidth Avj. 

2.1.6. Image Consequence 

The image plane consequence of unmodeled sources is influ- 
enced by some additional factors. Sources that lie beyond the 
edges of a synthesis image will be aliased back into that image. 
The degree of suppression of such aliased responses is deter- 
mined by the convolving function that is used to grid the visi- 
bility data. This constitutes a degree of averaging of the highly 
modulated visibilities (as discussed above) to provide partial 
cancellation of off-axis responses. Gridding functions that are 
currently used in synthesis imaging provide about 40 dB sup- 
pression of the aliased response in the far sidelobe regime (e.g. 
iBriggs et a l. 1999) What the gridding function will not suppress 
is any synthesized beam sidelobes of distant sources that would 
naturally occur within the image. Since the apparent brightness 
of a far sidelobe source will often be highly time variable, it is 
the instantaneous synthesized beam quality which becomes im- 
portant in this regard. In the most extreme case of a one dimen- 
sional array configuration, the instantaneous synthesized beam 
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Fig. 4. Measured visibility "noise" due to the NVSS > 10 Jy sky as seen by the VLA C configuration without primary beam 
attenuation (left). The two solid curves show the fluctuation level with and without the inclusion of the Sun. The two dashed curves 
include the effects of time and frequency smearing and are overlaid with representative power-law approximations. Visibility "noise" 
per solid angle due to the NVSS sky (right). The RMS contribution of NVSS sources, normalised to a one square degree field of 
view, is plotted as function of baseline length. Flux bins from Table Q] are shown individually with the labelled curves. The root 
squared sum of contributions of the bins < 10 Jy is shown as the solid line, together with an overlaid powerlaw fit. 



far sidelobes can have amplitudes of 100% so that unmodeled 
sources can add spurious structure to the image with very lit- 
tle suppression beyond that provided by the main beam. Even 
in the case of a regular three arm configuration such as used in 
the VLA, the instantaneous synthesized beam has peak sidelobe 
levels of 10's of percent. What prevents this factor from becom- 
ing dominant in most broad-band continuum applications is the 
radial frequency scaling of the synthesized beam pattern, includ- 
ing its sidelobes. The sidelobe pattern will partially cancel in the 
frequency averaged response. 

2.2. Near-in Sidelobes 

In addition to the issue of far-sidelobes, is that of the near-in 
sidelobe pattern of the antenna response. Any aperture blockage 
and non-zero illumination taper at the edges of the antenna will 
give rise to near-in sidelobes in the antenna response. For a uni- 
form aperture illumination, such as might be employed for an 
aperture array station, these will have a peak value, e$ - 0.1, 
while the more tapered, but "blocked" illumination patterns of 
prime focus and symmetric Cassegrain-fed dishes yield peak 
near-in sidelobes of about e$ = 0.02. The unblocked 100 m GBT 
apert ure has near-in peak sidelobe levels of less than 1% at 1.4 
GHz (iRobishaw & Heilesl 20091) although these have significant 
azimuthal modulation. 

Near-in sidelobes complicate the calibration and imaging 
problem in a number of ways. Significant sidelobe responses 
will increase the region of sky that must be modeled with high 
precision to enable self-calibration. The increase in the num- 
ber of degrees of freedom that are absorbed by sky compo- 
nents may then limit the precision of the self-cal solution as 
outlined above. Since these components are significantly further 



off-axis than main-beam components, they will also require pro- 
portionally shorter solution timescales to be employed, which 
impacts the solution precision via the visibility signal-to-noise 
constraint. This is particularly problematic if the sidelobe pattern 
is not fixed on the sky during source tracking, since one then en- 
counters the under-determined self-cal solution condition more 
rapidly. Som e deconvolution aspect s of this problem have been 
addressed bv lBhatnagar et al] (2008) with extensive modeling of 
both the VLA main beam and its near-in sidelobe and polarisa- 
tion response, including its changing orientation relative to the 
sky. A self-cal implementation has been described in Bhatnagar 
et al. d2004h . When it proves too challenging to model sources 
occuring within the near-in sidelobes they will yield a residual 
visibility modulation of, 



o~s — e sS rms(Qs) 



(17) 



in terms of the net RMS source brightness, S rms > within the area 
of the sidelobe pattern, Q5 . The net visibility fluctation level due 
to the NVSS source population per unit area is shown in the 
right hand panel of Fig. [4] This plot is similar to Fig. [2] but is 
calculated for a reference solid angle of 1 deg 2 rather than 2n sr, 
and the summation excludes the two brightest flux bins so that 
it better represents a random, but "quiet", piece of the sky that 
does not include the 100 or so brightest sources per hemisphere. 
A power-law representation of the net visibility fluctuation level 
is given by, 



1 RMS 



650 Q 



650 Q 




(B < B s ) 



(18) 
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where the RMS brightness scales roughly as v -0-8 , and the 
baseline length is normalised to the break at about B$ = 
3(v/1.4 GHz) -1 km, below which the RMS flux saturates. Of 
note in equation[18]is the square root dependence on solid angle, 
which accounts for the incoherent vector summation of sources 
in the field of view. In cases where the main beam field of view 
exceeds about 200 deg 2 , the "quiet" sky approximation is clearly 
no longer relevant, and the complete source population must be 
considered yielding, 



7500 a 



7500 Q 




0.8/ „ N -0.85 
a 



(B<B' S ) (19) 
(B>B' S ) mJy, 



with B' s 



0.35(v/1.4GHz)^' km. As will be seen below, the 
values of S rms an d S' RMS are intermediate between the peak 
brightness, S Max, and the integrated brightness, Sjot, within a 
field of size Cldeg- 

The telescope main beam has its first diffraction null at a 
radius of l.22A/d radians which is also approximately the main 
beam FWHM. This corresponds to a main beam solid angle, 



Q M = n(l.22A/df/4. 



The first diffraction sidelobe extends from radius (l.22A/d 
2 AAA Id) and so has solid angle of, 



Q. s = 3Q.M = 3n(l.22A/df/4. 



(21) 



The correlation timescale and bandwidth for these modula- 
tions due to sources offset from the pointing direction by 8 — A/d 
will be t ~ d/((L>B) and Av/v as d/B, yielding, 



Mrs = 



TjB 



1.38 x 10 4 c/ 



and, 



M, 



rs 



Av T B 
vd 



(22) 



(23) 



Near-in sidelobe patterns of dishes have been shown to have 
substantial frequency modulation at the 10% level (Popping & 
Braun 2008) with a periodicity of about f$ ~ c/2lc Hz, for a ver- 
tex to prime focus "cavity" separation, Iq. For Cassegrain optics 
the relevant separation, lc, appears to be approximately that of 
the primary and subreflector, while for offset Gregorian designs 
it is that of the feed and subreflector. The correlation bandwidth 
will be about half a period, Av ss c/(4lc) which yields, 



M' FS = 4l c Av T /c. 



(24) 



The larger of the two values given by eqns.l23landl24lprovides 
the better estimate of the independent sample number in the full 
observation. 



2.3. Main-Beam Issues 

There are several ways in which attributes of the main beam of 
the antenna will influence image dynamic range. In the case of 
aperture arrays, it is conceivable that the antenna main beam 
may be continuously changing as a field is tracked on the sky 
due to the changing geometric foreshortening of the aperture. In 
this case, the main beam sky brightness model will be highly 
time variable and one will need to operate essentially in the 
snap-shot self-cal mode, without the benefit of the factor of 



Mj additional data constraints that come with a time-invariant 
main beam model. In the event that different aperture array main 
beams were to apply to different antennas (as implemented for 
the three variants of the LOFAR high band antennas) this is fur- 
ther complicated, since there are only smaller subsets of the vis- 
ibilities (e.g. A^ 2 /8) for which the same snap-shot sky bright- 
ness model would apply, greatly diminishing the number of rel- 
evant data constraints relative to sky brightness components. A 
means of compensating for different main beam types as well as 
main beam variabili ty has been termed the A-Projection method 
and is decribed by [Rau et all (2009) The method employs a 
gridding convolution function for the visibilities that compen- 
sates for modeled main beam properties. This provides an ex- 
act method for the "reverse" calculation of visibilities given a 
sky model, but only an approximate "forward" calculation of the 
dirty image from the acquired data. The degree of utility of the 
"forward" correction depends critically on the completeness of 
(u, v) coverage within the support of the visibility convolving 
function that applies to an individual main beam type. In the 
limiting case, where highly isolated (u, v) samples pertaining to 
a given main beam type are gridded, the convolving function will 
have no net correcting effect on the dirty image. 



(20) 2.3.1. Pointing Errors 



A second important class of main beam influences relates to the 
antenna pointing accuracy (either mechanical for dishes or elec- 
tronic for aperture arrays). Pointing variations will introduce a 
strongly time variable component to the radio sky brightness 
model for components that are significantly off-axis. If the sky 
brightness model is dominated by such widely distributed com- 
ponents, then pointing variations may again push one into the 
snap-shot self-cal mode. For the simplest form of pointing er- 
ror, that preserves main-beam shape, the model variations could 
in principle be captured with only two additional degrees of 
freedom per antenn a (the two perpendicular pointing offsets). 
Bhatnagar et al. (2008) have also considered this class of error 
in their study. Pointing errors that are a fraction, Bp, of the main 
beam of size, P - Bp A/d radians, are most serious where they 
introduce the strongest absolute modulation in apparent source 
brightness. For a Gaussian approximation to the main beam, 
M = exp(-O.5(0/s) 2 ), the maximum spatial derivative occurs 
at a bore-sight offset equal to the dispersion, s = A/ (2d), and has 
amplitude exp(-0.5)/s rad -1 , giving an amplitude modulation, 
ep = Bp A exp(-0.5)/(sd) = 2Bpexp(-0.5) « Bp. The maximum 
amplitude modulation expected on the flanks of the main beam 
is thus comparable to the pointing error expressed as a fraction 
of the main beam size, ep ~ Bp. Actual main beam shapes can 
only be well-fit by a Gaussian out to about 10% of the peak re- 
sponse. Beyond this radius lies a deep response minimum fol- 
lowed by the near-in sidelobes. The main beam power pattern of 
an interferometer arises from the product of the complex voltage 
response patterns of each pair of antennas. The deep response 
minimum in the power pattern corresponds to the region where 
the sign of the voltage pattern of each antenna changes from pos- 
itive to negative. A uniformly illuminated circular aperture has a 
voltage response pattern given by V(x) = 2J\(x)/x, where Ji(x) 
is the Bessel function of the first kind of order 1. Evaluation 
of the radius at which the product of sky area (nx 2 ) with V(x) 
with the derivative of V(x) is maximised yields a similar result 
to that found above for the Gaussian approximation. The largest 
and most probable absolute modulation still occurs on the main 
beam flanks (at about 20% of peak power in this case) and has 
amplitude comparable to the pointing error expressed as a frac- 
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tion of the main beam size, <pp ss ftp. The resulting visibility 
modulation due to mechanical pointing errors is given by, 

(T P = 0.7 PdS RMS (il M )/A (25) 

after scaling with the RMS source brightness that occurs within 
the main beam, S rms (Qm), and a typical attenuation on the main 
beam flank of 0.7. The same analysis will also apply to the case 
of variations in the main beam shapes for different stations in 
an array. The failure to account for such variations that are a 
fraction, ftp, of the main beam size will give rise to the same 
magnitude of residual visibility based errors. 

An important attribute of pointing errors is that they are 
likely to be intrinsically broad-band in nature and so will not 
diminish significantly with frequency averaging. The correlation 
timescale of pointing errors will depend on their nature. Errors 
due to wind buffeting will have characteristic timescales, t>, of 
seconds while errors in the global pointing model or due to dif- 
ferential solar heating are likely to be correlated over many min- 
utes to hours. Estimates for the independent time and frequency 
intervals are therefore simply, 



Mjp — tt/tp 



and, 



M, 



FP 



1. 



(26) 



(27) 



In the event that there are also electronic pointing errors of mag- 
nitude, e' p , and correlation timescale t' p , these will yield an addi- 
tional error contribution with, 



cr'p - Q.le' P S rms(Qm), 
M' T p = Tt/t'p, 
and, 

M' FP = 1. 

2.3.2. Polarisation Beam Squint and Squash 



(28) 
(29) 



A third main-beam issue is that of the wide-field polarisation 
response and its possible variation with time. Polarisation cali- 
bration, in its traditional form, calibrates only the relative gain 
and apparent leakage of polarisation products in the on-axis di- 
rection. Any differences in the spatial response patterns of the 
two perpendicular polarisation products that are measured by 
the antenna will translate into so-called "instrumental polarisa- 
tion" in off-axis directions. One of the more e xtreme examples 
of this phenomenon is termed "beam squint" (lUson & Cotton! 
2008) which occurs in cases where the feed does not illumi- 
nate a parabola symmetrically. This condition even applies to 
the Cassegrain optics of the JVLA, since the feeds are not lo- 
cated on the symmetry axis of the main reflector, resulting in 
RR and LL beams that are similar in size and shape, but are sig- 
nificantly offset from one another, by 5.5% of the main beam 
FWHM, on the sky. Since this offset is introduced by the an- 
tenna optics, it rotates on the sky during source tracking of the 
(alt, az) mount. In this case, the sky brightness model used for 
self-calibration will need to be similarly time variable. Since 
this variability is systematic in nature it could in principle be 
modeled a nalytically, and this ha s been undertaken with some 
success bv lUson & Cotto n (2008) The beam squint can be min- 
imised in secondary focus systems by satisfying the condition 
(Mizugutc h et all 19761) 



tana 



|1 -e 2 |sin/3 
(1 + e 2 )cosy8 - 2e ' 



where a is the tilt angle between the subreflector and the feed, 
/3, the tilt angle between the subreflector and main reflector, 
and e, the eccentricity of the subreflector. This condition forms 
the basis for current offset Gregorian designs, including that 
of the GBT, and delivers very good performance, with cross- 
polarisation amplitudes of better than -40 dB when the subre- 
flector size exceeds about 25/1. For smaller subreflector dimen- 
sions, the polarisation performance is diminished by diffractive 
effects (Adatia 1978) increasing to about -32 dB for a 10/1 sub- 
reflector. The corresponding measur ed values of beam squint for 
the GBT dRobishaw &H eiles 2009) of about 1.5 arcsec at 1420 
MHz, represents a 0.3% FWHM offset at a frequency where the 
7.5 m secondary is about 38/1. 

Even for fully symmetric optics designs, there will inevitably 
be significant differe nces in the polarisation beams from single 
pixel fed dishes (e.g. lPopping & Braunl 20081) with the XX and 
YY beams having an elliptical shape with a perpendicular rela- 
tive orientation that has been termed "beam squash". Ellipticities 
of between 3 and 4% are measured for the XX and YY WSRT 
beams at 1.5 GHz. Beam squash of the linear polarisation prod- 
ucts for the off-axis GBT design has been measured to be about 
20 arcsec between 1200 and 1800 MHz dHeiles et all 20031) or 
about 4% FWHM. In the case of an equatorial (WSRT) or paral- 
lactic axis (ASKAP) mount the instrumental polarisation pattern 
will at least be fixed relative to the sky, so that additional de- 
grees of freedom (or modeling complexity) are not introduced 
by time variability. Aperture arrays and phased array feeds offer 
interesting prospects for greatly suppressing instrumental polar- 
isation via the choice of beam-forming weights that accurately 
match the spatial response patterns and relative alignment of the 
two polarisation beams. When unmodeled beam squint or squash 
of magnitude, €q, is affecting an observation it will contribute a 
visibility modulation of about, 



(30) o-q = 0.1e Q S RMS (£l M )- 



(32) 



The rate of change of parallactic angle with hour angle for an 
(alt, az) mount depends on the source declination, but in general 
the rate is about twice the sideral one, so for a source on the 
main beam flank the number of independent time samples will 
be about, 



Mjn = 7, 

y £/1.38xl0 4 

while this effect is fully coherent with frequency, 
M FQ = 1. 



(33) 



(34) 



(31) 



2.3.3. Multi-path Frequency Modulation 

Finally is the issue of frequency modulation of the main beam 
response pattern. Just as noted above for the near-in sidelobe 
pattern, the main beam dimensions of symmetrically fed dishes 
have been shown to be modulated at about the eg = 0.05 level 
(e.g. Popping & Braun 2008) with a periodicity of about c/2lc 
Hz. What is particularly pernicious about this phenomenon is 
that it represents a position dependant amplitude response which 
is not removed by an on-axis calibration of the instrumental 
bandpass. While the iPopping & Brau n (2008) result was doc- 
umented for the prime focus WSRT dishes, a comparable phe- 
nomenon is seen in Cassegrain systems. We illustrate the nor- 
malised gain of the 22 m diameter, Cassegrain focus, ATCA 
system in the 2-3 GHz band in Figure [5] What's shown is the 
ratio of the cross-correlated amplitude toward a compact cali- 
bration source, PKS 1934-638, normalised by the square root of 
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2.4. Synthesized Beam Issues 



.2 1 




0.9 



2.2 2.3 
Frequency (GHz) 



Fig. 5. Normalised gain of the ATCA 22m antennas as function 
of frequency. The solid curve is the mean gain over all interfer- 
ometer baselines while the dashed curve is a sinusoid with 20.5 
MHz periodicity and 5% peak modulation about the mean. 



the product of the two relevant au to-correlations for each inter- 
ferometer baseline. As shown in Popping & Braunl (20081) this 
quantity is proportional to the interferometer gain and is tied 
to a comparable modulation of the main beam effective area. 
Even in the case of the off-axis optics of the 100 m Green Bank 
Telescope, there are a wide range of quasi-sinusoidal ba seline 
effects that have been documented by Fish er et al.l ([2003). The 
most significant of these is consistent with multipath effects be- 
tween the subrefiector and feed horns and has an amplitude at 
1.4 GHz of eg = 0.005 and frequency period c/2lc Hz, for a 
subrefiector-feed separation, Iq. The visibility modulation of this 
effect is given by, 



<r B = 0.7 e B S rms (&m) 



(35) 



for a typical attenuation of 0.7 on the beam flank, while the num- 
ber of independent time samples is 



M- 



T jB 



TB 



2<il.38 x 10 4 ' 



(36) 



As discussed above for the near-in sidelobes, there are two rele- 
vant frequency modulation intervals to consider, corresponding 
to, 



M FR = 



and 



Av T B 
2vd ' 



M' = Al c Av T lc. 



(37) 



(38) 



The larger of these two values would apply to the full observa- 
tion. 



Some of the influences of the synthesized beam have already 
been mentioned previously; in particular the far sidelobes of the 
instantaneous synthesized beam that will be present in an im- 
age, due to the residual response of all inadequately modeled 
sources on the sky. The frequency scaling of the far sidelobe pat- 
tern provides significant attenuation of these fluctuations once 
broad-band frequency averaging has been applied. Another is- 
sue in this area is the quality of (w,v) coverage in relation to 
the sky brightness distribution that requires modeling; both its 
main-beam and off-axis components. If the (m, v) coverage is in- 
adequate, for example due to a dearth of short interferometer 
spacings, this will introduce a limitation on the fidelity of the 
self-calibration model that is deduced, for example via spurious 
source components, which in turn will limit the precision of the 
solution. An interesting challenge to consider in this regard is 
the Sun, with its 10 6 Jy, 30 arcmin disk and intermittent active 
regions. If the far sidelobe antenna response does not provide 
sufficient suppression of this source, it will constitute a signif- 
icant modeling challenge in any self-calibration solution, as al- 
ready demonstrated in Figure [3] Of more general concern is the 
challenge of modeling sources within the main beam of an obser- 
vation that are only partially resolved by the synthesized beam. 
From Table [T]it is clear that the typical sources encountered ran- 
domly in a self-cal context are unlikely to be point-like in nature, 
but rather partially extended for baselines exceeding a few km. 
Such sources provide a particular modeling challenge, since in- 
trinsic source structure must be distinguished from the general 
image-plane smearing effects induced by imperfect calibration 
of the visibilities. An estimate of the maximum modeling error 
can be made from the Gaussian decomposition of the Cygnus 
A power spectrum shown in Figure Q] The fine-scale substruc- 
ture within this powerful radio galaxy accounts for between 10 
and 30% of the total flux. This suggests that a modeling error, 
£m > 0.1, might be the consequence of simply using a point-like 
model for such sources. 

We have attempted to better quantify source modeling er- 
rors by considering the quality of a CLEAN reconstruction of 
Cygnus A when it has been assigned an integrated flux of 100 
mJy and an angular size of 8 arcsec and is observed for 12 
hours with the VLA in the C configuration in a single spectral 
band at 1.4 GHz. The noiseless input and reconstructed visibil- 
ities for an on-axis source location are contrasted in Figure [6] 
Realistic thermal noise was included in the imaging and de- 
convolution process but no additional calibration errors, imply- 
ing that this is in many ways a best case scenario. Two ver- 
sions of the reconstructed visibility amplitudes are shown, one 
in which both positive and negative apparent brightness com- 
ponents found in a CLEAN deconvolution are retained and the 
other in which only positive brightness components were re- 
tained. The input and derived model visibilities have a mean 
difference of €m = 0.007 when all components are retained, 
but this increases to = 0.07 when only positive components 
are permitted. The positive source component constraint is often 
employed in practise as a means to discriminate against the in- 
clusion of instrumental calibration errors into the source model 
that is used for self-calibration. The large modeling errors which 
arise from this constraint are cause for substantial concern. The 
unphysical nature of the negative source model is apparent in the 
visibility amplitudes seen in the centre panel of Figure [6] which 
peak at a finite radius rather than at the origin. Despite the un- 
physical nature of such a source model, it provides a significantly 
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Fig. 6. Comparison of input (left) and reconstructed (centre, right) visibilities of a 100 mJy Cygnus A, when observed on-axis in 
the marginally resolved regime (8 arcsec source size and 12 arcsec beam) with negligible time and bandwidth smearing. The model 
visibilities in the right hand panel were constrained to only positive source components. The mean visibility error is 0.7% (centre) 
and 7% (right) relative to the integrated flux of the source. 




Fig. 7. Comparison of input (left) and reconstructed (centre, right) visibilities of a 100 mJy Cygnus A, when observed significantly 
off-axis in the marginally resolved regime (8 arcsec source size and 12 arcsec beam). The integration time and channel bandwidth 
were chosen to keep smearing below 10% of the synthesized beam at the edge of the main beam. The model visibilities in the right 
hand panel were constrained to only positive source components. The mean visibility modeling error is 0.9% (centre) and 10% 
(right). 



better match for the source visibilities at locations where they 
have been measured. 

When the same source is observed significantly off-axis, by 
15 arcmin (a half beamwidth) in both RA and Dec, using a 
channel bandwidth and integration time that keep smearing ef- 
fects in this configuration below 10% of the synthesized beam 
at the main beam edge: Av/v = 10~ 3 and r = 10 sec, the mod- 
eling errors increase as seen in Figure [7] The mean visibility 
errors increase to 0.9% and 9%, for the unconstrained and con- 
strained cases. The unconstrained modeling error increases with 
larger visibility smearing timescales, reaching 12% for integra- 
tion times of r ~ 160 sec (while Av/v is fixed at 10~ 3 ). When 
the positivity constraint is invoked, this effect dominates over 
smearing errors until r ~ 120 sec. The magnitude of the mod- 
eling error increases approximately linearly with the smearing 
scale as illustrated in Figure [8] Background sources dominated 
by a compact flat spectrum core would be more likely to be mod- 
eled accurately, at least when the dominant component is placed 
exactly on an image pixel. However, the vast majority of random 
field sources encountered at GHz frequencies are not of this type. 
The resulting visibility modulation is given approximately by, 



<T M = 0.1 € M S 



(39) 



where a linear scaling with self-cal solution timescales and fre- 
quency intervals (t$ , Avj ) is introduced relative to the nominal 



intervals (r*, Av*) that keep smearing effects below 10% of the 
synthesized beamwidth in eqns.[5]and[6] A typical attenuation of 
0.7 for the brightest field source on the main beam flank is also 
assumed. We will also consider the case where a high precision 
source model is available from other sources, in which case the 



residual modeling error, 
scale, 



r M ~ 0-7 e M S rms (flikf). 



need not scale with the smearing 



(40) 



This circumstance might arise, for example, if a previous high 
quality model of the field were available, or the field were fully 
sampled with the overlapping primary beams of a phased ar- 
ray feed and visibility smearing effects were explicitly included 
when undertaking data comparison during self-calibration. The 
number of independent time and frequency samples at this off- 
axis distance are given by, 



M 



T T B 



TM 



2J1.38X 10 4 ' 



and 



FM 



Av T B 
2vd ' 



(41) 



(42) 



Significan t research o n better source modeling methods (eg. 
lYatawattall2010l 20111) will be vital to allowing such high pre- 
cisions to be achieved. 
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Fig. 8. Mean source modeling error as function of the relative 
visibility smearing scale on the main beam flank. The reference 
smearing scale is taken to be 10% of the synthesized beam at 
half power of the main beam. The solid curve is a high preci- 
sion model that likely represents a best case scenario, while the 
dashed curve is a simple model that is constrained to have only 
positive components. 



2.5. Self-cal Parameters 

The signal-to-noise ratio obtained within a self-cal solution in- 
terval depends crucially on both the integral source flux density 
available and the number of major components over which it 
is distributed. The NVSS statistics of Table Q] and the Cygnus 
A power spectrum of Figure Q] were used to calculate the inte- 
grated flux density per unit area together with the sum of flux- 
weighted component number shown in Figure [9] The solid line 
in the figure gives the total flux per square degree due to sources 
between 3 and 300 mJy at 1 .4 GHz (for which the likelihood of 
occurrence in a one degree field is at least unity) while the dark 
dashed line includes brighter sources as well. Contributions to 
the integral are quite evenly split between the 10, 30 and 100 
mJy bins, while contributions to component number are domi- 
nated by the faintest flux bins. As noted previously, source pop- 
ulations are likely to be dominated by luminous radio galaxies 
(like Cygnus A) above about 30 mJy, while edge-darkened radio 
galaxies are more likely to dominate below this flux, implying 
that some modification of our power spectrum template might 
be appropriate for the faintest flux bins. A representative expres- 
sion for the integrated flux density per unit area is, 



S Tot - 920 D.deg 

= 920 a,,. 



1.4 GHz 

v 



-0.8 



(B < Br) 



(43) 



-0.8, ,-0.75 

— (B> B R ) mJy deg~ 
1.4 GHz/ \B R V id 



where the total brightness scales roughly as 



-0.8 



and the 



baseline length is normalised to the break at about Br = 
10(v/1.4 GHz) -1 km, below which the total flux saturates. The 



corresponding expression for flux-weighted major component 
number is, 



N c = 66 n deg (B < B R ) 



(44) 



B 



0.75 



{B>B R ) deg 2 . 



Other important quantities needed to assess the magnitude 
of many contributions to the visibility noise budget are S mox(Q), 
the maximum source brightness that can be expected by chance 
in a field of a given solid angle together with its overall angular 
size, i/> in arcsec. The NVSS source counts of Table Q] can be 
approximated very well with the arbitrary analytic form, 



log(/V(>S)) = 2.24- 



[log(Sl.4Gff;) + 5] 

13.9 



2.4 



(45) 



which is overlaid on the count data in Figure [10] and agrees 
within 0.02 dex over the range 0.003 < S i .4 < 3. Jy. The median 
source size, iff, of Table Q] is also plotted in Figure [10] together 
with an analytic approximation, 



logtyr) = 0.31 + 0.27 [log(S lAGHz ) + 3] 

+ 0.74exp{-2[log(5 L 4G//z)- 1-3] 2 } 



(46) 



that agrees with the data to within 0.05 dex. Equation|45]can be 
inverted to yield the flux of the brightest 1 .4 GHz source occur- 
ring in a solid angle Cl^g, as, 

Siamux = 1.4-antilog 10 ({13.9[2.24-log(l/Q rfeg )]} a417 -5)Jy, (47) 

where the leading factor of 1 .4 accounts for the median flux of 
sources within the 0.5 dex bins used in the source counts. This 
can be scaled to other observing frequencies and baseline lengths 
via, 

-0.8 



» Max 



1 AMax 



1 AMax 



1.4 GHz 



1.4 GHz 



(B < B^) 



(48) 



-0.8 



-0.75 



(B > B$) Jy, 



where the characteristic baseline By, that resolves the source is 
given by, 



Bj, = 5 



10 arcsec 



1.4 GHz 



km. 



(49) 



2.6. External Calibration 



There are instances where self-calibration may not be practi- 
cal and others where it may not even be necessary. It is useful 
to define the base noise level due to main beam sources under 
these circumstances. For a gain error of magnitude, <pc, with a 
correlation timescale, tc and fractional correlation bandwidth 
(Av/v) c = 6c, we can write the visibility noise associated with 
a source in the main beam as, 



0~Cal 



l>CJ RMS 



("if), 



(50) 



together with the number of independent time and frequency 
samples in a full track observation as, 



M TC 
and 

M FC - 



Tt/t c , 



v6 c 



(51) 



(52) 
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Fig. 9. The mean integrated flux density per unit area (left) due to the extragalactic sky and the corresponding number density of 
flux-weighted source components (right) as function of angular scale. The total is given by the solid curve, while individual bins 
from Table 1 are plotted as short-dashed curves. 




Fig. 10. The N VSS 1 .4 GHz source counts (left) per 0.5 dex bin are shown as the filled points connected by a solid line. The analytic 
approximation over the range 0.003 < S 1.4 < 3. Jy that is given by eqn.|45]is overlaid as the dashed curve. The median source size 
from TableQ]overlaid with an analytic approximation is shown in the right hand panel. 



In the event that the main beam field of view approaches 200 
deg 2 , the large field version of the visibility fluctuations, S' RMS , 
must be considered yielding, 



^Cai ~ *PcS RMS (Q*). (53) 



3. Some Practical Examples 

With all of the quantities and scaling relationships defined in §|2] 
it becomes possible to characterise the self-cal performance and 
potential limitations of specific antenna systems. Although the 
principles we outline are completely general, the source statistics 
and properties we model are based on data obtained near 1.4 
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GHz and will be most relevant to frequencies between about 100 
MHz and 5 GHz. Outside of this range, there may be sig nificant 
differences in the so urce population (eg. lScaife & Hea ld (2012) 
iMassardi et all (20110). 

In the subsections that follow, the attributes of several exist- 
ing and planned arrays listed in Table|2]will be considered, both 
in terms of visibility noise contributions at the basic self-cal so- 
lution interval, as well as the final image noise contributions in 
an observation of a random field that is tracked for tj — 12 hours 
in the case of a dish array or tj = 4 hours for the case of aperture 
arrays. For the full-track case we will consider both a standard 
continuum mode observation, in which we assume that a total 
fractional bandwidth of Avt/v = 0.1 is ultimately averaged to- 
gether as well as a standard spectral line mode observation in 
which a spectral resolution of Av^/v = 10~ 4 is utilised in the 
image. For the self-cal solution interval, it will be important to 
achieve a signal-to-noise ratio that is sufficient to allow conver- 
gence to be achieved. We will define this by requiring a residual 
calibration error, as given by eqn. [2] of <f>(js) < 0.5 rad. This 
yields a visibility solution noise limit, 



a-soi < 0.5 S Tot (N-3f 5 /N° c 5 . 



(54) 



For the full-track continuum imaging it will be of relevance to 
compare the final image noi se contributions with the source con- 
fusion level. ICondon et al. (2012) have recently estimated the 
magnitude of such confusion due to the faint source population 
to be, 



o- C f„ ~ 1.2/Jy 



3.02 GHz 



-0.7 



10/3 



(55) 



For the purposes of illustration we will consider several exam- 
ples of main beam source modeling errors, one in which signif- 
icant effort has been expended to achieve a nominal precision 
€m = 0.01, a cruder version e' M - 0.1 and finally a high pre- 
cision variant, = 0.001. To simplify the interpretation, we 
will not apply additional scaling of these values with the smear- 
ing factors of eqn. [39] Recall from Sec. l2.4l that a 1% precision 
was found challenging to achieve for marginally resolved back- 
ground sources with a realistic degree of complexity and that 
a 10% error was often encountered in practise with simplistic 
modeling approaches. Achieving 0.1% modeling precision rep- 
resents a major technical challenge that would rely both on more 
extensive information of the field under study than is typically 
available as well as the algorithm complexity to utilise that in- 
formation effectively. The full track cases will also be compared 
with the baseline noise level due to gain calibration errors of 
main beam sources to illustrate the consequence of not apply- 
ing self-calibration at all. A nominal gain error of <pc = 0.2, 
corresponding to 10 degrees of phase, a correlation timescale 
of Tc = 15 minutes and a correlation fractional bandwidth 
of 5c = 0.1 will be assumed throughout for illustration pur- 
poses. All of the relevant parameters are collected and defined in 
Tabled 

Of further interest is an answer to the question: "What is 
the ultimate limit to image noise contributions that follows from 
multiple long duration tracks of the same field?". The answer 
follows from considering which of the error terms are correlated 
over multiple observing tracks and which are not. The thermal 
contribution cry, will clearly be uncorrected from one track to 
the next. For each of the other contributions we have considered, 
this is not guaranteed. If the same model is being employed to 
determine the self-calibration variables for all data tracks, then 
residual errors due to model imperfections will be identical at 



the corresponding hour angles of subsequent tracks and the cr M 
component will not average down. In the case of pointing errors, 
<Xp, which have not been modeled, it will depend upon the nature 
of these errors. If they are the result of an incomplete pointing 
model or diurnal temperature variations they may well approx- 
imately repeat on subsequent days. If instead they are due to 
more random variations such as wind buffeting, they will aver- 
age down. The contributions due to main beam frequency mod- 
ulation, <x B , as well as unmodeled sources both within near-in, 
as , and far sidelobes, o~n, are intrinsic to the sky-telescope sys- 
tem and as such will repeat exactly for multiple tracks and will 
not average away. An exception to this is the case of far side- 
lobe pickup of the Sun, cr D , which will have a different relative 
location and hence different visibility response on subsequent 
observing dates. The nominal initial calibration error, <pc, is not 
likely to be correlated between observing sessions if it is domi- 
nated by ionspheric or tropospheric phase fluctuations. 

3.1. TheJVLA 

Let's first consider the JVLA, with its N -27 (alt, az) mounted 
dishes of d — 25 m. The logarithmic three-arm "Y" of the JVLA 
has a high central concentrat ion of bas elines in each of its con- 
figurations (see Fig. 25-4 of Hjellming 1989) so that more than 
half of all baselines are within Bm«/ = Bmo.v/6. The four standard 
configurations, designated (A, B, C, D) are defined by Bmox - 
(36.4, 11.1, 3.4, 1.03) km. The recently upgraded Cassegrain re- 
ceivers provide continuous frequency coverage with a series of 
approximately octave band feeds between about 1 and 50 GHz 
with a system equivalent flux density, 



SEFD 



(Ma) 



(56) 



in terms of the Boltzman constant, kg, the system temperature, 
Tsy S , the physical area of a dish, A, and the aperture efficiency, 
Tj a- Online JVLA documentation suggests, 



SEFD* 250 + 3.4 



1 GHz 



9 Jy, 



(57) 



that applies to 1 < v < 15 GHz, yielding a visibility thermal 
noise per polarisation product, 



(Tj — 



SEFD 



(tAv) 



0.5 



Jy. 



(58) 



Pointing errors of the VLA 25m dishes have an RMS magnitude 
of about P — 10 arcsec under good nighttime conditions, that 
is likely correlated over timescales of about t> = 15 minutes. 
The pointing errors have a larger amplitude under daytime and 
high wind conditions, sometimes exceeding an arcminute, but 
can also be very effectively reduced under good conditions by a 
reference pointing strategy to better than about 5 arcsec. Beam 
squint of the JVLA m ain beam has been accurately measured 
(lUson & Cotton! 2008D and would contribute e G = 0.055 if not 
modeled in an observation. The magnitude of main beam mod- 
ulation with frequency of the VLA has not been documented, 
but the similarity in feed geometry to that of the ATCA suggests 
that a comparable value of e# = 0.05 will likely apply when this 
modulation is not explicitly accounted for in the self-calibration 
process. The primary-subreflector separation is, Iq = 8.2 m. 
The near-in sidelobe level of the main beam is about 2%, so 
es = 0.02, if such sources, including their apparent time vari- 
ability are not accounted for in the self-cal model of the field. In 
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Table 2. Measured and Assumed Telescope Parameters. 



Telescope 


N 


d 

(m) 


Bmux 

(km) 


(km) 


If 


ft 


p 

(arcsec) 


TP 

(min) 


4 


(min) 


e Q 




k 

(m) 


JVLA-D Config. 


27 


25 


1.0 


0.17 


0.1 


0.02 


10 


15 






0.055 


0.05 


8.2 


JVLA-C Config. 


27 


25 


3.4 


0.57 


0.1 


0.02 


10 


15 






0.055 


0.05 


8.2 


JVLA-B Config. 


27 


25 


11.1 


1.85 


0.1 


0.02 


10 


15 






0.055 


0.05 


8.2 


JVLA-A Config. 


27 


25 


36.4 


6.07 


0.1 


0.02 


10 


15 






0.055 


0.05 


8.2 


ATA 


42 


6.1 


0.32 


0.08 


0.7 


0.01 


90 


15 






0.04 


0.01 


3.0 


ASKAP 


36 


12 


6.0 


0.63 


0.2/2" 


0.02/10° 


10 


15 


0.01 


1 


0.04/100° b 


0.05/10° 


6.0 


MeerKAT 


64 


13.5 


8.0 


0.50 


0.2 


0.01 


10 


15 






0.04 


0.01 


7.0 


SKAl-Survey 


96 


15 


20 


1.0 


0.2/2° 


0.01 


10 


15 


0.01 


1 


0.04/10" 


0.01/10° 


7.0 


SKAl-Dish 


250 


15 


20x5 r 


1.0 


0.2 


0.01 


10 


15 






0.04 


0.01 


7.0 


LOFAR HBA Core 


48 


30.8 


3.5 


0.25 


0.5 


0.1 






0.01 


1 


0.01 






LOFAR HBA Ext 


64 


30.8 


121. 


1.0 


0.5 


0.1 






0.01 


1 


0.01 






MWA 


128 


4.4 


3.0 


0.3 


0.5 


0.1 






0.01 


1 


0.1 






SKAl-LowCore 


35 


180 


5.0 


0.5 


0.5 


0.1 






0.01 


1 


0.01 






SKAl-Low Ext 


50 


180 


100. 


2.5 


0.5 


0.1 






0.01 


1 


0.01 







Notes. (fl) Phased array feeds provide improved aperture illumination and reduced beam modulation. (o) A parallactic angle mount provides 
stationary near-in sidelobes and main beam. w Analysis is presented for B Ma x = 20 km, while the array is projected to extend to B Ma .\ = 100 km. 



view of the significant azimuthal variation in the sidelobe prop- 
erties, and their rotation with parallactic angle, accounting for 
these effects ad ds substantial complexity to the processing (eg. 
Bhatnagar et al. 2008) The far sidelobe contributions under day- 
time, cr D , and nighttime, cr N , conditions follow directly from 
the dish diame ter and eqn. [U which has been calibrated with 
i]p = 0.1 from [Dhawanl ( 2002) and eqns. [T4l and [T3l The mag- 
nitude of the residual source modeling errors will depend on the 
methods and effort that have been employed to minimise these 
effects. 



The noise budget for the JVLA in its D configuration is 
shown in Fig.QT| On the solution timescale appropriate to keep 
smearing effects below 10% of the synthesized beam width the 
thermal noise, cr T , is below that which is required for self-cal 
convergence <x So /. This implies that full-field self-cal can be per- 
formed with minimal smearing effects. As will become apparent 
below, this condition is often not met in practise by telescope 
systems, so that more extensive data averaging becomes neces- 
sary. The thermal noise is seen to dominate over all noise contri- 
butions except for daytime observations below about 1 .4 GHz, 
where visibility modulation by the Sun becomes comparable. A 
full track continuum observation is dominated by source confu- 
sion noise, o~cfn, below about 1.5 GHz. However, the calibra- 
tion noise, cr Ca ;, vastly exceeds the thermal noise and becomes 
dominant overall at higher frequencies, implying a clear need for 
self-calibration. The next largest contributions come from source 
modeling errors (in the case, a J M , where only 10% precision is 
achieved), followed by beam squint <xg, near-in sidelobe <ts and 
main beam modulation cr B . Sky-limited continuum performance 
above about 2 GHz will rely on achieving a high source mod- 
eling precision and explicit beam squint compensation. In the 
case of a full track spectral line obervation, the thermal noise 
is seen to dominate above 2-3 GHz, although self-calibration 
will be necessary at lower frequencies. Solar modulations will 
have a negative impact on daytime spectral line performance at 
frequencies below about 1.5 GHz. Ultra-deep spectral line ob- 
servations will ultimately become limited by the stationary error 
contributions as noted above. The most important of these are 
<t' m , <ts, cr B and ctq. Even so, it should be possible to improve 
upon the single track sensitivity by about a factor of several in 
long integrations, provided the source modeling precision is bet- 



ter than about 3% and most importantly, that radio frequency 
interference is negligible. 

The same type of analysis applied to the JVLA-C configu- 
ration is given in Fig. [T2] Here we see that while the thermal 
noise dominates over other contributions on the desired solution 
timescale, it exceeds what is required for self-cal convergence 
osoi, so that more extensive data averaging, by about a factor of 
two in both domains, becomes necessary. This will imply more 
extensive time and bandwidth smearing for off-axis sources, 
which may limit the source modeling precision which can rou- 
tinely be acheived. The full track continuum observation will 
require self-calibration to reach the thermal noise. Substantial 
effort needs to be expended on achieving a source modeling pre- 
cision approaching 1%, otherwise this contribution will domi- 
nate the noise budget below 3 GHz. The next most serious ef- 
fect is that of beam squint. As noted previously, active model- 
ing of this effect has proven effective at improving image dy- 
namic range at the expense of computational complexity (Uson 
& Cotton 2008) By addressing both of these issues satisfactorily 
it should become possible to reach the thermal noise. Reaching 
the source confusion level o~cfn in deep integrations will be con- 
siderably more challenging, since several additional error con- 
tributions would need to be overcome. The full track spectral 
line observation is thermal noise limited above 2-3 GHz, while 
self-calibration with at least moderate source modeling precision 
(< 10%) will be required at lower frequencies. Even deep inte- 
grations should still be in this regime, provided that RFI remains 
negligible. 

Very similar patterns are seen in the more extended B and A 
configurations as shown in Figs.ll3land[T4l The more extended 
configurations require smaller time and frequency averaging in- 
tervals to support the iterative full-field source modeling inher- 
ent to self-calibration, but the finite sensitivity per visibility does 
not allow this to be achieved while providing enough signal-to- 
noise. Excess averaging in both domains by about a factor of five 
(and ten) are required for the B (and A) configuration(s). This 
will severely impair the ability to achieve high levels of mod- 
eling precision on the main beam flanks for the simplest imag- 
ing algorithms. More complex wide-field imaging and modeling 
strategies will be mandatory under these circumstances, such as 
employment of multiple distributed phase centres. While such 
methods are well-understood, they impose a significant compu- 
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Table 3. Parameter Definitions. 



Variable Denning Equations Comments 



T* 




1 




Nominal self-cal solution timescale 


Ay* 




8 




Nominal self-cal solution bandwidth 






2 




Self-cal residual error on solution timescale 


C Map 




3 




Image plane natural noise 






20 




Main beam solid angle 


£h 




21 




Near-in sidelobe solid angle 






43 




Integral source brightness in solid angle 


N c (£i) 




44 




Flux-weighted number of source components in solid angle 


S RMS (SI) 




18 




RMS source brightness in solid angle 






19 




Large field RMS source brightness in solid angle 






48 




Maximum source brightness in solid angle 


a- sol 




54 




Self-cal solution noise required for convergence 


CTCfn 


m 


Source confusion noise 


0~Cal 


Ho] 


Gain calibration noise 


a Cal 


m 


Large field gain calibration noise 


SEFD 




56 




System equivalent flux density 


a-j 




58 




Thermal noise 


0C 


150J 


Main beam gain calibration error 


e F 








Far sidelobe attenuation relative to on-axis 


£s 


El 


Near-in sidelobe attenuation relative to on-axis 


<> 




28 




Electronic pointing error 


tQ 




32 




Main beam shape asymmetry 


e B 


eh 


Main beam shape modulation with frequency 


£m 




39 




Source modeling error 


cr N 


[131 


Nighttime far sidelobe visibility modulation 


VD 


m 


Daytime far sidelobe visibility modulation 


OS 


El 


Near-in sidelobe visibility modulation 


a-p 




25 




Main beam mechanical pointing visibility modulation 


<r' p 




28 




Main beam electronic pointing visibility modulation 


O-Q 




32 




Main beam asymmetry visibility modulation 


OS 




Main beam modulation visibility modulation 


0~M 


1391401 


Source modeling error visibility modulation 


P 




25 




Mechanical pointing precision 


Tp 




26 




Timescale for mechanical pointing modulation 


r'r 




29 




Timescale for electronic pointing modulation 


T C 




51 




Timescale for calibration error 


Sc 




52 




Fractional bandwidth for calibration error 


M T f 




15 




Far sidelobe independent time solutions 


M FF 




16 




Far sidelobe independent frequency solutions 


M TS 




22 




Near-in sidelobe independent time solutions 


M FS 


|23|24| 


Near-in sidelobe independent frequency solutions 


M TP 




26 




Main beam mechanical pointing independent time solutions 


M FP 




Main beam mechanical pointing independent frequency solutions 


M' TP 




29 




Main beam electronic pointing independent time solutions 


m 'fp 




Main beam electronic pointing independent frequency solutions 


M t q 


m 


Main beam asymmetry independent time solutions 


M F Q 


M 


Main beam asymmetry independent frequency solutions 


M TB 




36 




Main beam modulation independent time solutions 


M FB 


|37|38| 


Main beam modulation independent frequency solutions 


M TM 




41 




Source modeling error independent time solutions 


M FM 




42 




Source modeling error independent frequency solutions 


M TC 


EB 


Calibration error independent time solutions 


M FC 




52 




Calibration error independent frequency solutions 



tational overhead. The full track continuum performance relies 
on self-calibration and beam squint effects will also need to be 
addressed to reach the thermal noise. Source modeling preci- 
sion better than about 1 % will be required, together with explicit 
modeling of the full range of main beam and near-in sidelobe 
effects (ctq, crp, <tb and cr s ) if deep integrations are to remain in 
that regime. The challenges in all these areas are greatest at the 
lowest frequencies and for the less extended configuration. 



3.2. ATA 

The A llen Telescope Array dWelch et al.l (2009) Harp et al.l 
(2011)) consists of = 42 off-axis (alt,az) mounted dishes 
of d = 6.1 m. The ATA configuration is very compact with 
Bmox = 323 m and BMed = 81 m. The dishes are fed by an ultra- 
wideband log-periodic feed from a Gregorian focus over the fre- 
quency range 0.5 to 10 GHz. A model for the system temperature 
performance of the system as function of frequency presented by 
IWelch et al.l (20091) provides, 

T Sys = 19.7 + 4v G ' 5 + 9.5v£ 5 + 0.8v G + 3v c 2/7 K, (59) 
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Fig. 11. The noise budget for the JVLA-D configuration for a single solution interval (left) for a full track continuum observation 
(centre) and for a full track spectral line observation (right). Individual noise contributions are as defined in the text. 
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Fig. 12. The noise budget for the JVLA-C configuration as in Fig.QjJ 
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Fig. 13. The noise budget for the JVLA-B configuration as in Fig.QT] 
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Frequency (MHz) 



in terms of the frequency in GHz, vg, at an assumed aperture ef- 
ficiency, t]a = 0.6. Typical values of receiver temp erature and il- 
lumination efficiency tabulated by (Harp et al. 201 1 ) suggest sig- 
nificantly higher values of Ts ys /t]a between about 70 and 270 K. 
The values of Ts ys given by eqn. [59] range between about 37 
and 60 K, with corresponding SEFDs between 5800 and 9400 
Jy. The pointing accuracy of the dishes under good nighttime 
conditions is P — 90 arcsec with a likely correlation timescale 



Tp — 15 m inutes. The beam squint for this offset design has been 
measured (lHarpet all 201 II) to be about 4% FWHM at 1.4 GHz, 
yielding eg = 0.04. Major asymmetries in the main beam shape, 
at the 20% level, are also documented graphically by Har p et al.l 
(201 1) , suggesting that a beam squash contribution to eg may be 
significantly larger. The peak near-in sidelobe levels measured 
at 2300 MHz are about 1%, yielding e s = 0.01, with substan- 
tial azimuthal variation in sidelobe intensity. While beam and 
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Fig. 14. The noise budget for the JVLA-A configuration as in Fig.fTTI 



sidelobe modulation with frequency have not been documented 
directly, they might be expected to track the gain modulation 
with frequency intrinsic to the log-periodic feed, which has an 
RMS fluctuation level of about 1 dB, or e' B ~ 0.25 and corre- 
lation bandwidth, Av/v =s 0.03. This particular gain modulation 
is due to the quantised, resonant element wavefront sampling 
of the wide-band feed and is in addition to any multi-path in- 
duced gain modulation with frequency as discussed previously. 
Multi-path beam modulation should have a much lower ampli- 
tude, of perhaps eg ~ 0.01 (twice the GBT value). Since the 
feed is mounted on the edge of the primary reflector surface, the 
primary-subreflector separation of about Iq ~ 3 m is likely the 
most relevant cavity dimension for an estimate of modulation pe- 
riodicity. For the purpose of our analysis, we will co nsider both 
of these possibilities for main beam modulation e#. lHarp et a"D 
(2011) have also carried out far sidelobe measurements of the 
ATA dishes at 2300 MHz which are in the range of -30 to -40 dB 
over much of the sky. This implies a value of rjf - 0.7 for use 
in eqn.|4] which is significantly higher than that which applies to 
the upgraded JVLA. 

The noise budget analysis for the ATA telescope is shown 
in Fig. Q3] What is striking is that visibility fluctuation levels 
are predicted to be completely dominated by the far sidelobe 
response on the self-cal solution timescale. During daytime ob- 
serving this is true up to 3 GHz, while even for nighttime ob- 
serving this will be the case below 1.4 GHz. This is a conse- 
quence of both the small dish size and the very compact array 
configuration. Substantial visibility averaging (by about a factor 
of five in both domains) would diminish these contributions to 
the point that the self-cal signal-to-noise criterion could be met. 
A full track continuum observation would be source confusion 
noise limited, but many of the noise contributions under con- 
sideration would exceed the thermal noise below about 1 GHz. 
Above 1 GHz, the potential main beam modulation associated 
with gain variations of the wide-band log-periodic feed becomes 
the most important contributor for nighttime observations. The 
full track spectral line case is dominated by far sidelobe effects 
below a few GHz. The nominal calibration noise exceeds the 
thermal noise below about 1 GHz, implying that self-cal may not 
be required for spectral line applications at higher frequencies. 
In that case, it may be possible to simply ignore the far sidelobe 
response contributions, since the visibility gridding function will 
provide some 40dB of suppression for such distant sources. Any 
sidelobes of the instantaneous synthesized beam response will of 
course not be suppressed during imaging which may become a 



limitation in this case. Deep spectral line integrations would en- 
counter a range of issues that would necessitate extensive mod- 
eling and self-calibration to overcome. 

3.3. ASKAP 

Let us now consider what challenges will be faced with the 
ASKAP array, with its N = 36, three axis {alt, az, pol) mounted 
dishes of d — 12 m. The ASKAP configuration is highly cen- 
trally concentrated with Bmqx - 6 km and BMed - 630 m. The 
dishes are fed with a prime focus PAF (phased array feed) that 
covers the frequency range 700 to 1800 MHz. Current perfor- 
mance estimates are for Ts ys /T]A = 55 K across this band, yield- 
ing an SEFD of 1340 Jy for the 12m dishes. 

Mechanical pointing errors of the ASKAP dishes under nor- 
mal operating conditions have not yet been determined, but will 
be assumed to be about P = 10 arcsec under good nighttime con- 
ditions and correlated over timescales of about t> = 15 minutes. 
What must also be assessed is the electronic pointing error that 
might be associated with gain fluctuations of the individual an- 
tenna elements that are combined to produce the main beam in a 
PAF. While actual values have not yet been determined, current 
estimates of the element gain variability are better than e' p = 0.01 
with variability timescales of t' p — 1 minute. 

The main beam modulation phenomenon that afflicts single- 
pixel-fed radio telescopes has been demonstrate d to be dimin- 
ished by between 20 and 30 dB in a PAF system (Oosterlo o et al.l 
20101) employing Vivaldi antenna elements. The reason for this 
improvement appears to be related to a much better and phys- 
ically more extended power matching to the incident radiation 
field, such that reflectivity from the feed is minimised. While 
similar measurements are still required for the chequerboard an- 
tenna array of the ASKAP phased array feeds, it is expected that 
they will have similar performance in this regard. We will as- 
sume a conservative 10 dB reduction in the frequency modula- 
tion, to e B ~ 0.005. 

The phased array feed also provides the potential to strongly 
suppress beam squash through optimisation of the individual po- 
larisation beams and their relative alignment. Furthermore, as 
noted previously, the ability to track parallactic angle with the 
"polarisation" axis allows any residual beam squash pattern to 
be fixed on the sky during tracking so that the apparent bright- 
ness of sources on the beam flank remains unchanged. For il- 
lustration purposes we will assume a 10 dB reduction due to 
each of these contributions to beam squash from a base level 
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that would apply to an (alt, az) mounted single pixel feed of 4%, 
giving e Q = 0.0004. 

The near-in sidelobe pattern of a phased-array feed illumi- 
nated reflector is expected to have a comparable near-in side- 
lobe level to that of the single-pixel-fed equivalent (Oosterloo 
et al. 2010) so is likely to have a peak amplitude of some 2%. 
However, unlike the single-pixel-fed counterpart, this pattern is 
not expected to display an amplitude modulation with frequency. 
As noted above for the main beam modulation, the phased array 
feed illumination pattern is expected to be very similar over fre- 
quency rather than modulated at the 5% level. Moreover, the po- 
larisation rotation axis of the ASKAP telescope mount will per- 
mit all of the telescope sidelobes, including the radial spokes as- 
sociated with the prime focus support struts, to remain stationary 
on the sky during source tracking. The time and frequency sta- 
bility of the near-in sidelobe response would enable straightfor- 
ward modeling of sources at these large offsets. Furthermore, it 
is planned that all 36 digital main beams formed by the PAF will 
be processed simultaneously via a joint self-cal source model for 
the entire field of view. The implication is that source informa- 
tion for modeling the field of each main beam and its near-in 
sidelobes is provided by a beam in which that source is essen- 
tially on-axis, with minimal smearing effects or main beam at- 
tenuation. Provided that the relevant time and frequency smear- 
ing effects for each beam are explicitly included during data 
comparison, it should permit very low values of both e$ and 
to be obtained. For illustration purposes, we will assume only a 
conservative 10 dB reduction in this component to ej = 0.002. 

For the far sidelobe contributions under daytime, cr D , and 
nighttime, cr N , conditions we will assume rjf = 0.1 since we 
expect excellent illumination quality in this prime focus system, 
comparable to what currently applies to the Cassegrain JVLA 
and the prime focus WSRT together with eqns. |4] [14] and [13] 
This represents an improvement by perhaps a factor of 2 relative 
to a single pixel fed dish of this size. 

The noise budget analysis is presented in Fig. [16] Thermal 
noise is seen to dominate on the self-cal solution interval, al- 
though the signal-to-noise requirement implies that additional 
data averaging (by about a factor of five in both domains) is 
needed. The full track continuum case demonstrates that self- 
calibration will be essential and that source modeling errors will 
be the single most important factor to address for this array de- 
sign. For single track observations, these must be kept below 
about 1 % to achieve thermal noise limited performance, but for 
deep integrations above 1 GHz, it will be necessary to achieve 



even better precision in order to reach the source confusion level. 
As noted above, the simultaneous imaging of the multi-beam 
field-of-view offers good prospects for achieving high source 
modeling precision. It is also of interest to compare the contribu- 
tions of electronic, <r' p , and mechanical pointing, <x P , to the noise 
budget in this case. While these contributions have different fre- 
quency scalings, they have a similar magnitude for the parame- 
ters assumed here. It is also worth noting that the electronic com- 
ponent is not expected to be correlated over multiple observing 
sessions, so will average down further in a deep observation. The 
full track spectral line case should permit routine thermal noise 
limited performance to be achieved, although self-calibration is 
required across the band. Deep spectral line integrations will be- 
come dependent on the quality of source modeling. 

3.4. MeerKAT 

MeerKAT is currently planned to consi st of N = 64 off-axi s 
(alt,az) mounted dishes of d — 13.5 m dBooth & Jo nas 2012) 
A single Gregorian focus feed covering about 1 to 1 .74 GHz is 
planned for the first phase of deployment with a specified total 
sensitivity of about 220 m 2 /K, corresponding to T^/tja ~ 42 K 
and an SEFD of 810 Jy. While the telescopes have not yet been 
fabricated, we will assume that relatively good pointing accuracy 
of about P k 10 arcsec may be realised under good conditions 
with a correlation timescale t> * 15 minutes. Beam squint for 
the MeerKAT dishes is expected to be somewhat larger than that 
of the GBT, given the smaller size of the subreflector of about 
4 versus 7.5 m, so may be about e@ « 0.01 in practise. Beam 
squash simulations are not available, so we will assume that 
values comparable to those achieved at the GBT will apply of 
€q sa 0.04. Peak near-in sidelobe levels will depend on the degree 
of edge illumination, but might be about e$ ~ 0.01. Similarly, 
the multi-path induced beam modulation effects might also be 
slightly larger than those of the GBT, es ~ 0.01, while the domi- 
nant resonant "cavity" dimension in the system will probably be 
the primary-subreflector separation of about Iq ~ 7 m. The re- 
cent study of lCortes & Imbriala (2009) provides an indication of 
far sidelobe levels from similar optical designs and dish dimen- 
sions of about -40 dB at 1.4 G Hz. This would su ggest rjf « 0.5 
for use in eqn.|4] However, the ICortes & Im briale (2009) study 
used the non-optimum illumination of an ultra-wideband feed 
and is strongly influenced by subreflector spillover. An octave 
band Gregorian feed design is more likely to achieve superior 
illumination quality of perhaps rjf « 0.2. The MeerKAT array 
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Fig. 17. The noise budget for the MeerKAT telescope as in Fig.fTTI 



configuration will have Bmox = 8 km, but be highly centrally 
concentrated with B Me d ~ 500 m. 

Inspection of the noise budget analysis in Fig.fTTIshows that 
more substantial data averaging (by about a factor of four) is 
required to meet the crsoi criterion in the self-cal solution inter- 
val. For the full track continuum case, self-calibration is clearly 
required and the subsequent noise budget is completely domi- 
nated by source modeling errors. Reaching the thermal noise in 
a single track will require achieving better than about 1 % mod- 
eling precision, which will be challenging. Beam squash effects 
constitute the next challenge, with these contributions exceed- 
ing the thermal noise by factors of several. Explicit modeling 
of the time variable response in the elliptical polarisation beams 
will be essential to achieving thermal noise limited performance. 
Comparison with the JVLA plots for similar configurations in 
Figs.[l2]and[T3] suggests that it will be about twice as challeng- 
ing for MeerKAT to achieve thermal noise limited performance 
as the JVLA. 

Achieving even lower noise levels in deep integrations will 
require a significant further reduction in the source modeling 
and beam squash errors, together with explicit modeling of ad- 
ditional main beam and near-in sidelobe issues (cr P , cr B and cr s ). 
The full track spectral line analysis suggests that thermal noise 
limited performance will be straightforward to achieve provided 
self-calibration is employed, with deep integrations becoming 
sensitive to the precision of source modeling. 



3.5. SKA 1 -Mid 

The Square Kilometre Array Organisation has recently an- 
nounced the intention that the SKA Phase 1 mid-frequency dish 
array be deployed as two complimentary components: an SKA1- 
Survey instrument to be sited in Australia and an SKA 1 -Dish 
array to be sited in South Africa. 

3.5.1. SKA1 -Survey 

The SKA 1 -Survey instrument is projected to consist of = 96 
dishes of d = 15 m covering the frequency band of 0.45 - 
3.0 GHz with three overlapping phased array feed systems, 
each of octave bandwidth. While the array configuration has 
not been determined in detail, it might be expected to have 
Bmux - 20 km and BMed = 1 km. The sensitivity specification 
calls for Tsvs/r/A = 50 K above about 1 GHz, accepting the fact 
that the Galactic contribution will become significant at lower 
frequencies. An estimate of the frequency dependence of system 
performance is given by, 

T Sy , * T Nom + 60^; 55 K, (60) 

in terms of a nominal temperature, T^om ( tnat excludes the 
Galactic contribution) and the wavelength, A m in meters. 
Relevant values for the PAF systems might be T^ om — 37 K and 
t\a = 0.8. An antenna and mount design have not yet been deter- 
mined for the SKA, but for the present purpose we will assume 
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that an off-axis Gregorian-fed system with an (alt,az) mount 
is utilized. We will assume good mechanical pointing accuracy 
of about P x 10 arcsec with a correlation timescale rp » 15 
minutes. Electronic pointing errors of e' p = 0.01 correlated over 
t' p — 1 minute will also be assumed. In the absence of a polar- 
isation mount axis, it will be necessary to track the digital PAF 
beams which are offset from boresight across the focal plane. 
The substantial intrinsic beam squash of about 4% is likely to 
be improved by about 10 dB with polarisation specific PAF il- 
lumination to about €q = 0.004. The multi-path induced beam 
modulation will be decreased from its nominal 1 % level in such 
an off-axis design by at least 10 dB to €b ~ 0.001. Near-in side- 
lobes es ~ 0.01 will be assumed in this offset design. Good qual- 
ity PAF illumination should provide an improvement of about a 
factor of two in the far sidelobe efficiency to rjf — 0.1. 

The self-cal solution noise budget in Fig. 1181 demonstrates 
yet again that there is insufficient sensitivity for self-calibration 
at the "natural" time and frequency interval for straightforward, 
full beam imaging, necessitating some further data averaging. 
The full track continuum case will require self-calibration and 
significant attention to source modeling, particularly at the low- 
est frequencies where a precision of < 1 % will be required, with 
the near-in sidelobes representing the next largest error contribu- 
tion. It is the absence of a polarisation axis which introduces time 
variability into the near-in sidelobe response and adds this com- 
plexity to the imaging problem. Obtaining very deep integrations 
will rely on further precision in modeling sources both in the 
time variable near-in sidelobes as well as in the main beam. Full 
track spectral line observations are expected to be much more 
straightforward, although they will require self-calibration. Deep 
integations will become limited by source modeling precision. 

3.5.2. SKA1-Dish 

The SKAl-Dish array is projected to consist of N = 250 dishes 
of d = 15 m covering the frequency band of 0.45 - 3.0 GHz 
with three overlapping single pixel feed systems, each of octave 
bandwidth. The array configuration is expected to extend out to 
Bmqx =100 km but maintain a BMed ~ 1 km. However, to sim- 
plify comparison of the results with both SKA 1 -Survey as well 
as the more extended configurations of the JVLA, we will only 
illustrate the case of Bmox = 20 km. The single pixel feed sensi- 
tivity specification calls for Ts vs /J]a = 44 K above about 1 GHz. 
Relevant values for use in eqn. [60] might be T^ om — 28 K and 
tja = 0.7. The mechanical pointing errors assumed above will 
apply with P « 10 arcsec and a correlation timescale t> » 15 
minutes, as well as the near-in sidelobe response of es ~ 0.01. 
A beam squash of eg « 0.04 and multi-path modulation of 
£b ~ 0.01 with lc ~ 7 m such as discussed for MeerKAT are also 
likely to be relevant here. A far sidelobe response of rjf = 0.2 is 
assumed to apply for the likely dish illumination quality. 

The thermal noise is just sufficient to permit self-cal con- 
vergence on the "natural" solution interval at the bottom end 
of the frequency coverage, but exceeds that limit above about 
600 MHz. A full track continuum observation will require self- 
calibration and will need to achieve high precision source mod- 
eling in the main beam, varying between about 0.3 and 3% from 
the bottom to the top of the band. The beam squash effects must 
also be actively modeled in order to permit the thermal noise 
limit to be achieved. Deep integrations would only approach 
the source confusion limit if a range of additional effects were 
effectively modeled, including the near-in sidelobes and main 
beam modulation. Full track spectral line observations will re- 



quire self-calibration, but are only moderately sensitive to the 
source modeling precision. 

3.6. LOFAR High Band 

The LOFAR array dHeald et all 201 lh consists of stations dis- 
tributed over much of the Netherlands as well as in other 
European countries. For the purpose of our analysis we will con- 
sider only the LOFAR High Band that covers frequencies be- 
tween 120 and 240 MHz and the stations within NL. It will be 
useful to consider both the core configuration of N = 48 stations 
of d — 30.75 m diameter for which Bmox = 3.5 km and a core 
plus remote configuration with a total of N = 64 stations and 
Bmqx = 121 km. While the remote stations have a larger physi- 
cal size than those in the core, we will assume that only the cen- 
tral d = 30.75 m of these outer stations is utilised to simplify 
their combination with the core stations during imaging. The 
configuration of stations is highly centrally concentrated with 
a baseline density that declines roughly as B 2 beyond about 
100 m. For the core configuration we will assume BMed = 250 m 
while for the core plus remote configuration this might increase 
to B Med = 1 km. The SEFD of the LOFAR High Band core sta- 
tions is projected to vary approximately as, 

SEFD » 2150 + 0.215( — V — - 175^1 Jy. (61) 
\ 1 MHz / 

Mechanical pointing errors clearly do not apply to an aperture 
array. There may well be an electronic pointing error associated 
with gain fluctuations of the individual antenna elements that are 
combined to produce the station beam. Actual values are not yet 
documented but the impact will be assesed by assuming a station 
beam variability amplitude of about e' p = 0.01 and timescale of 
t' p — 1 minute. As was the case for the phased array feed, we 
expect beam squint and beam squash to be minor, if there is op- 
timisation for these parameters. However, the minimisation of 
beam squash for an aperture array will require active compen- 
satation of the geometric foreshortening of the stations for the 
duration of the observing track. While this can be realised it will 
come at the expense of sensitivity, since the "lowest common 
denominator" approach for the effective aperture would need to 
be applied. An alternative to this straightforward strategy would 
be the use of the "A-projection" algorithm, as outlined previ- 
ously, to attempt to compensate for the changing, assymetric sta- 
tion beams during an observation. As noted above, this method 
only provides approximate "forward" correction of the acquired 
data and is very sensitive to the degree of instantaneous (w, v) 
coverage. Experience on the applicability and precision of this 
method needs to be acquired and documented. A reference value 
of €q = 0.01 will be assumed to track the impact of any resid- 
ual unmodeled beam assymetries. As noted at the outset of this 
section we will assume that a full track observation has tj — 4 
hour duration to limit the severity of geometric foreshortening. 
Main beam modulation due to multipath effects is clearly not 
relevant to the aperture array geometry, eliminating the eg com- 
ponent. The near-in sidelobe amplitude is also open to optimi- 
sation with the choice of digital tapering of the station beam 
weights. However, a lower sidelobe will come at the expense 
of station sensitivity. For illustration purposes, we will assume a 
uniform aperture weighting to achieve maximum sensitivity, al- 
though this yields a large value of es =0.1. Measured HBA side- 
lobe levels are somewhat greater than 10% and are highly mod- 
ulated with azimuthal angle. The magnitude of the far-sidelobe 
response of the LOFAR stations is not yet documented. In view 
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Fig. 18. The noise budget for the SKAl-Survey telescope as in Fig.fTTI 
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Fig. 20. The noise budget for the LOFAR High Band Core telescope as in Fig.fTTI 



of the discrete rather than continuous nature of the aperture sam- 
pling, a relatively large value of r\p « 0.5 might be appropriate. 

The LOFAR Core configuration noise budget is shown in 
Fig-GO] At the nominal solution timescale, the thermal noise eas- 
ily satisfies the self-cal signal-to-noise requirement, crj < crsoi- 
However, the far sidelobe response to both the daytime, cr c , and 
the nighttime sky cr N , completely dominate the noise budget by 
several orders of magnitude. Self-calibration will not be pos- 
sible without very extensive modeling of the sky for a nomi- 



nal solution interval (t*, Ay*). Comparison with Fig. |2]demon- 
strates how deep it will be necesssary to go before these con- 
tributions will no longer dominate the fluctuation level on a 
solution interval. The first order of magnitude reduction will 
be addressed by adequately modeling the 30 brightest sources 
(per 2n sr) with S\aghz > 30 Jy. Unfortunately, even this may 
not be sufficient below about v =180 MHz. The next flux bin, 
10 < StAGHz < 30 Jy, contains an additional 70 sources per 
hemisphere. With this reduction in fluctuation level (by about a 
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factor of 17) self-calibration should become practical over the 
entire band. The other method for reducing the far sidelobe fluc- 
tuations within a solution interval might be extensive data av- 
eraging in time and frequency. As can be seen from eqns. Q~3] 
and[14] increasing each of these by an order of magnitude would 
provide a factor of ten reduction in the fluctuation amplitude, 
although at the expense of a major negative impact on the abil- 
ity to model sources on the main beam flank with precision. In 
practise a hybrid approach might be employed whereby some 
number of the brightest sources are modeled directly and the re- 
mainder are suppressed by a degree of visibility averaging. The 
full-track noise budget plot demonstrates that the LOFAR Core 
will be source confusion limited for continuum observations af- 
ter only short integrations. Of potential concern is the fact that 
the thermal noise is exceeded by the majority of other effects 
under consideration. 

The full-track spectral line observation demonstrates the 
challenges that will be f aced by the Epoch of Reionization ex- 
periments (Zaroubi et al. 2012) Self-calibration will be neces- 
sary to permit the thermal noise floor to be reached. However, far 
sidelobe contributions vastly exceed the thermal noise and will 
require extensive modeling to achieve the necessary 20 to 30 dB 
reduction. This requirement is significantly more stringent than 
that encountered on the self-cal solution timescale and will be 
very challenging to meet in the lower half of the band, where 
100's of sources distributed over the entire sky must be modeled. 
Unmodeled sources in the near-in sidelobes also lead to excess 
image noise that exceed cr T by factors of several in each four 
hour track. As noted previously, the near-in sidelobe pattern with 
uniform aperture weighting is highly modulated with parallactic 
angle, implying that this time variability will require active mod- 
eling. Since both cr N and cr s are tied to the intrinsic telescope- 
sky response they will not average down in long integrations. 
The ultimate noise floor in a deep field observation will be en- 
tirely limited by the quality of the sky modeling. It may well 
be deemed beneficial to employ significant tapering of the sta- 
tion aperture to reduce the near-in sidelobe level. Unfortunately, 
such aperture tapering may have the undesirable side-effect that 
the far sidelobe response is further enhanced, since the effective 
station diameter, d, is reduced. 

The noise budget for the combined Core plus Remote config- 
uration is shown in Fig. |2T] The nominal solution timescale for 
this much larger value of Bmm does not provide adequate signal- 
to-noise for self-calibration. A substantial degree (factor 10) of 
time and frequency averaging is necessary to bring cr T down be- 
low osoi which will have an adverse effect on source modeling 
within the main beam. Such averaging will provide attenuation 
of the far sidelobe contributions by an order of magnitude. Even 
so, a further reduction of <x D or cr w by at least another order of 
magnitude is needed to allow for the possibility of self-cal con- 
vergence. As discussed above, a minimum requirement will be 
modeling and excision of the S \aghz > 30 Jy source category. 
The full-track continuum noise budget shows that source con- 
fusion noise has now become negligible relative to other terms. 
Unfortunately, there are many error contributions that exceed the 
thermal noise in a single track and self-calibration is clearly nec- 
essary to approach that limit. We will assume that the far side- 
lobe contributions have been reduced from the outset by about 
a factor on ten through modeling of the S \aghz > 30 Jy popu- 
lation. The next limitation becomes that of unmodeled sources 
in the near-in sidelobes, followed by modeling errors associ- 
ated with main beam sources. Reaching the thermal noise in a 
four hour track will require a source modeling precision that ex- 
ceeds 0.3%. This will be particularly challenging in view of the 



high degree of visibility smearing (factor 10) required to pro- 
vide solution signal-to-noise. The modeling of the time variable 
response to sources within the near-in sidelobes will also need 
to achieve a 1 % acccuracy, in order to avoid this potential lim- 
itation. It is likely to be very challenging to reach the thermal 
noise in such four hour tracks. Even deeper continuum integra- 
tions would need to improve on all of the previously mentioned 
error contributions. 

The full-track spectral line case is significantly less challeng- 
ing than the continuum for the Remote configuration, although 
again self-calibration is required across the entire band. Once the 
far sidelobe effects have been reduced to the point where self-cal 
becomes possible, the primary limitation come from adequate 
modeling of sources within the near-in sidelobes. This must be 
reduced by about a factor of two to reach cr T . Deeper spectral 
line integrations will need to improve further on the <x s com- 
ponent and may then also need to improve on the far sidelobe 
contributions cr N . 

3. 7. Murchison Wide-field Array - MWA 

The MWA (Lonsdale et al. 2009) is currently expected to consist 
of N = 128 station tiles, each a square of 4.4x4.4 m dimensions 
that cover the frequency band 80 - 300 MHz. The configuration 
is planned to be centrally concentrated with Bmox = 3 km and 
Buied ~ 300 m. The system temperature is expected to vary as, 

T Sys * Tjto + 60Aj, 55 K, (62) 

in terms of a receiver temperature, Tr cv and the wavelength, A m 
in meters. While the receiver temperature specification is not 
currently published, we will assume Tr cv = 150 K as a represen- 
tative value. The effective collecting area per station is expected 
to vary as, 

A ef f = 16^yJ (A < A T ) (63) 

= 16^ J (A>A T ) m 2 

for 16 dual polarisation antenna elements in each tile that are 
sparsely distributed for A < At = 2.2 m. We will assume an 
electronic pointing error associated with gain fluctuations of the 
individual antenna elements of about e' p = 0.01 and timescale 
of t' p — 1 minute. The rectangular tile geometry will give rise to 
large departures from axisymmetry for the main beam shape. We 
will also assume a residual beam squash error of eg =0.1 that 
applies to the full track observations of tt =4 hour duration. 
Untapered aperture weighting is assumed that might yield a near- 
in sidelobe contribution ej =0.1. The far sidelobe response will 
be characterised by T]f « 0.5. 

The noise budget plot of Fig. [22] demonstrates the regime 
that this facility will operate in. In view of the extremely large 
field of view of each tile, that varies between 200 and 3000 deg 2 
across the band, this is a case where the "quiet" sky approxi- 
mation is clearly not relevant and equation [T9l rather than [18] is 
employed in calculating noise terms. Far sidelobe pickup of the 
sky will dominate over all other contributions by two to three or- 
ders of magnitude on a natural solution timescale. Even full track 
continuum and spectral line observations are completely domi- 
nated by this all-sky response to a similar degree. Comparison 
with Fig.[2]suggests that even the S iachz < 0.1 Jy population is 
contributing to visibility modulations in this regime. Modeling 
of the time variable response to each of 10,000+ sources on the 
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Fig. 21. The noise budget for the LOFAR High Band Remote telescope as in Fig.fTTI 
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Fig. 22. The noise budget for the MWA telescope as in Fig. [TT] 



sky does not seem likely to be practical. Even the 8000 instan- 
taneous visibilities are less in number than the number of rel- 
evant sources on the sky, without considering any instrumental 
variables. Traditional self-calibration is not a viable option in 
this circumstance, although it appears to be essential to provide 
some improvement to the noise budget for both continuum and 
spectral line observations. A telescope operating in this regime 
will need to rely on the quality of external calibration that can 
be acheived and seems unlikely to achieve thermal noise limited 
performance. 

3.8. SKA 1 -Low 

The Square Kilometre Array Phase 1 low frequency aperture ar- 
ray is currently expected to consist of 50 stations of d — 180 m 
diameter operating between 70 and 450 MHz. The array config- 
uration is planned to consist of 25 stations within a diameter of 
1 km, a further 10 in the annulus of 5 km diameter and a final 
15 distributed out to perhaps 100 km. The system temperature is 
expected to vary as, 



150 + 60/t" 5 K, 



(64) 



in terms of the wavelength, A m in meters. The effective collecting 
area per station is expected to vary as, 



l 'ff 



I A 2 
1 12001 — 



( A < A T ) 



(65) 



= I1200(y 



(A > A T ) m 



given the 1 1200 dual polarisation antenna elements in each sta- 
tion that are sparsely distributed for A < At — 2.6 m. Many of the 
same considerations discussed previously for the LOFAR anal- 
ysis apply here as well. We will assume an electronic pointing 
error associated with gain fluctuations of the individual antenna 
elements of about e' p = 0.01 and timescale of t' p — 1 minute. We 
will also assume a residual beam squash error of eg = 0.01 that 
applies to the full track observations of tt = 4 hour duration. As 
a starting assumption we will assume untapered aperture weight- 
ing that yields a near-in sidelobe contribution e$ =0.1. The far 
sidelobe response will be characterised by t]f « 0.5. Just as done 
for the case of LOFAR we will consider both a core configura- 
tion with N = 35, Bmox = 5 km and BMed - 500 m, as well 
as an extended configuration with N = 50, Bm B x = 100 km and 
B Med = 2500 m. 

The noise budget for the SKAl-Lowcoreis shown in Fig. [23] 
Although the thermal noise is well below the self-cal solution 
limit on the natural timescale, there are a number of other con- 
tributions that exceed cr T by large factors. Far sidelobe pickup 
is chief among these, followed by near-in sidelobe contribu- 
tions and source modeling errors. The brightest 10-20 dB of the 
nighttime sky will likely require modeling to allow useful self- 
calibration to be undertaken. Full track continuum performance 
is completely source confusion dominated, while the spectral 
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line case is analogous to that seen on the solution timescale. Self- 
cal will be required to approach the thermal noise, but far side- 
lobe modulations dominate by two to three orders of magnitude 
over the thermal noise, followed by near-in sidelobe sources and 
source modeling errors. A source modeling precision of about 
0.3% is required, once the <tn and cr s contributions have been 
reduced to the point that they no longer dominate. Adequate 
modeling of the nighttime sky will be quite challenging, since 
its contribution will need to be reduced by about 20 dB below 
about 120 MHz. As noted previously, this will entail modeling 
down to the S \aghz < 0.1 Jy level, where more than 10 4 sources 
are above the horizon at any moment. 

The noise performance of the extended SKA 1 -Low array is 
shown in Fig. [24] While some far sidelobe modeling or data 
averaging is needed to enable self-calibration, this is less ex- 
treme than for the more compact core. The full track continuum 
performance faces a wide range of practical challenges. All of 
the effects under consideration exceed the thermal noise below 
200 MHz; the most severe being unmodeled sources in the near- 
in sidelobe which must be reduced by some 20 dB. The full track 
spectral line case is somewhat less challenging, although some 
10 dB are required here as well. Self-calibration is required for 
both continuum and spectral line applications. 



4. Discussion 

4.1. Dishes 

Consideration of the various telescope systems in Section [3] al- 
lows some general conclusions to be drawn. The "traditional" 
25 m diameter class dishes of existing synthesis radio telescopes 
operate in a regime where the thermal noise of the receiver sys- 
tem dominates the noise budget on a self-cal solution interval, 
typically by a large margin. Furthermore, at least for relatively 
compact configurations, there is adequate signal-to-noise pro- 
vided by sources occurring randomly within the field to allow 
self-calibration to be undertaken at close to the "natural" so- 
lution interval that keeps time and bandwidth smearing effects 
at a small fraction of the synthesied beamwidth over the entire 
primary beam field of view. This constitutes the most advan- 
tageous of circumstances, since simple imaging strategies can 
be employed that use only a single version of the data, in con- 
trast to more complex strategies that might require simultaneous 
processing of multiple phase centres. Even so, full track con- 
tinuum observations must overcome several potential obstacles, 



at the 10 dB level, before thermal noise limited performance is 
achieved below 2 GHz. This is consistent with practical experi- 
ence with the JVLA, the WSRT and the ATCA where consider- 
able care must be excercised with precise modeling of sources 
that occur within the field and in the case of the JVLA, explicit 
modeling of the non-axisymmetric beam shape and its rotation 
on the sky during tracking, to keep these effects from becom- 
ing dominant. Above about 2 GHz and in more extended array 
configurations it becomes necessary to introduce more data aver- 
aging prior to self-calibration than is optimum for simple imag- 
ing strategies. However, only a limited number of complications 
need be addressed to achieve thermal noise limited performance 
in full-track continuum observations in these circumstances. The 
full-track spectral line performance of such arrays is less subject 
to complications and only requires self-calibration to reach the 
thermal noise floor below about 2 GHz. Only in the most com- 
pact configurations and at frequencies approaching 1 GHz does 
the Sun begin to present some problems. 

This is in stark contrast to the situation encountered with syn- 
thesis arrays that employ significantly smaller dishes. The most 
striking comparison is with the ATA, where the combination of 
small dish diameter and compact array configuration yield a very 
different regime. For a nominal self-cal solution interval, the 
thermal noise is dominated by the far sidelobe response to the 
entire sky. The same is true even after integrating over an entire 
12 hour track, both for spectral line and continuum applications. 
Continuum observations are further challenged by a wide range 
of other noise contributions which dominate over the thermal 
noise. Its not clear whether high dynamic range imaging in this 
regime will be possible. 

The case of 12 to 15 m class dishes in more extended con- 
figurations is improved over that with only 6 m dishes. Even 
so, more than nominal data averaging is required to provide the 
signal-to-noise needed for self-calibration with this dish size, 
which will add complexity to wide-field imaging. Higher source 
modeling precision than what has been needed for the 25 m 
VLA dishes, by about a factor of two will be required, together 
with explicit modeling of any main beam asymmetries in cases 
where the parallactic angle is not fixed on the sky during an ob- 
servation. In this respect, the ASKAP telescope benefits from 
both the greater circular symmetry of polarisation beams pos- 
sible with a phased array feed as well as the polarisation rota- 
tion axis that keeps the beam orientation fixed on the sky during 
tracking. Single pixel fed systems with an (alt, az) mount will 
face significantly greater challenges to reach the thermal noise 
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floor. Deep field continuum observations will need to achieve 
extremely high source modeling precision that extends out into 
the near-in sidelobe region. 

Exactly the same considerations apply to the SKA, if 15 m 
class dishes are deployed as envisaged within the current base- 
line design. Achieving thermal noise limited continuum perfor- 
mance will be more challenging than with the 25 m dishes of the 
JVLA and sub-yt/Jy imaging would need to overcome a whole 
range of complex modeling issues. One of the greatest threats to 
thermal noise-limited performance is the departure from circu- 
lar symmetry of the polarisation beams and the rotation of this 
pattern on the sky. The use of phased array feeds provides one 
means of reducing the risk associated with beam asymmetries, 
while the adoption of an equatorial or (alt , az, pol) mount sys- 
tem provides another for eliminating their time variability. 

Adoption of a larger dish diameter would provide a signif- 
icant risk reduction and this option should be considered very 
seriously during cost/performance optimsation of the final SKA 
dish design. 

4.2. Aperture Arrays 

The aperture arrays considered in Section[3]display many of the 
same phenomena. The small station sizes of LOFAR (30.8 m) 
and particularly the MWA (4.4 m) result in visibilities that are 
completely dominated by noise-like modulations from sources 
appearing in the far sidelobe response. Very extensive modeling 
of the visibility response to these distant sources becomes nec- 
essary before self-calibration within the main beam can be con- 
templated. A major additional complication is introduced by the 
geometric foreshortening of each station aperture while track- 
ing a source. While an absence of moving parts is often cited 
as a major advantage of these systems, the continuously chang- 
ing response of the sky-telescope system is the price that is paid. 
Although efforts are underway to impro ve predictive modeling 
of both the far sidelobe response patterns ( Wiinholds et al. 2010) 
as well as compensation for the time-variable main beam shape 
dRau et al.l 20091) it is not yet clear what level of success will 
be achieved with either of these issues. While the scope of this 
problem is daunting for LOFAR, where some 20 dB of reduc- 
tion is needed to approach the thermal noise on a solution in- 
terval, it is even more extreme for the MWA where 30-40 dB 
are needed. Although self-calibration seems to be required to 
reduce the magnitude of gain calibration errors it does not ap- 
pear to be a viable option for the MWA. The source popula- 



tion responsible for residual visibility errors is so large, con- 
sisting of more than 10 4 relevant sources above the horizon at 
any moment, that it may not prove practical to model the time- 
dependent system response to these sources at all. This may 
be an intrinsically under-determined problem with the finite in- 
formation available within the solution interval over which the 
instrum ental response is tim e-invariant. While it has been sug- 
gested (Morales & Mateiek 2009) that alternate calibration and 
imaging strategies might be utilised to circumvent these chal- 
lenges, the fundamental question of sufficient independent data 
constraints on a solution timescale has not been demonstrated in 
either theory or practise. 

The SKA 1 -Low concept calls for much larger aperture ar- 
rays, of 180 m diameter, to combat the problem of far sidelobe 
response. While this approach helps, it is still the case that many 
noise contributions exceed the thermal noise, both on a self-cal 
solution interval as well as in full-track spectral line and contin- 
uum images. Some 30 dB of attenuation is necessary for a wide 
range of phenomena to approach the thermal noise in a single 
4 hour track. This will be extremely challenging to overcome, 
since even the 10 dB level issues faced by the VLA 25 m dishes 
at GHz frequencies have required many years of algorithm de- 
velopment to partially address. While large station diameters 
seem to offer the best prospects, it will still require very signifi- 
cant progress in both algorithm development and efficient com- 
putational implementations over what is available currently to 
approach thermal noise limited performance for v < 200 MHz. 
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